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Abstract 

This paper presents a large-system analysis of the performance of joint channel estimation, multiuser detection, and 
per-user decoding (CE-MUDD) for randomly-spread multiple-input multiple-output (MIMO) direct-sequence code- 
division multiple-access (DS-CDMA) systems. A suboptimal receiver based on successive decoding in conjunction 
with linear minimum mean-squared error (LMMSE) channel estimation is investigated. The replica method, developed 
in statistical mechanics, is used to evaluate the performance in the large-system limit, where the number of users 
and the spreading factor tend to infinity while their ratio and the number of transmit and receive antennas are kept 
constant. The performance of the joint CE-MUDD based on LMMSE channel estimation is compared to the spectral 
efficiencies of several receivers based on one-shot LMMSE channel estimation, in which the decoded data symbols 
are not utilized to refine the initial channel estimates. The results imply that the use of joint CE-MUDD significantly 
reduces rate loss due to transmission of pilot signals, especially for multiple-antenna systems. As a result, joint CE- 
MUDD can provide significant performance gains, compared to the receivers based on one-shot channel estimation. 
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Index Terms 

Multiple-input multiple-output (MIMO) systems, direct-sequence code-division multiple-access (DS-CDMA) schemes, 
channel estimation, multiuser detection (MUD), linear minimum mean-squared error (LMMSE) estimation, iterative 
receivers, successive decoding, large-system analysis, replica method, statistical mechanics. 

I. Introduction 

Direct-sequence code-division multiple-access (DS-CDMA) schemes are used in the air interface of third-generation 
(3G) mobile communication systems [1|-[3|. In order to improve the spectral efficiency of DS-CDMA systems, 
the extension to multiple-input multiple-output (MIMO) DS-CDMA systems has been actively considered ||4)- 
|fl~3l . As a drawback of using multiple antennas at the transmitters, the receiver structure of MIMO DS-CDMA 
systems becomes more complex than that of conventional DS-CDMA systems. Therefore, it is important in MIMO 
DS-CDMA systems to construct receivers achieving an acceptable tradeoff between performance and complexity. 

A goal in this research area is to construct receivers which achieve near-optimal performance by using acceptable 
computational costs, since optimal joint decoding is infeasible in terms of complexity. Separation of detection and 
decoding significantly reduces the complexity of the receiver, although it is suboptimal in terms of performance. In 
multiuser detection (MUD) fl4l . certain statistical properties of multiple-access interference (MAI) are used to detect 
data symbols. Large-system analysis is a powerful approach for evaluating the performance of MUD followed by 
per-user decoding for randomly-spread DS-CDMA systems. In this analysis an asymptotic limit is assumed, referred 
to as the large-system limit, in which the number of users and the spreading factor tend to infinity while their ratio is 
kept constant. A series of large-system analyses [ 15 1-[20| have revealed that the linear minimum mean-squared error 
(LMMSE) detection followed by per-user decoding achieves near-optimal performance for lightly-loaded systems 
with perfect channel state information (CSI) at the receiver. Furthermore, it has been shown numerically ll2TI - ll24ll 
and analytically ll25l . ll26l that iterative LMMSE-based multiuser detection and per-user decoding (MUDD) can 
achieve near-optimal performance even for highly-loaded systems with perfect CSI at the receiver. See |4j, lfl2l . 
ifTJl for the extension of these results to MIMO DS-CDMA systems with perfect CSI at the receiver. In the large- 
system analysis for MIMO DS-CDMA systems the numbers of transmit and receive antennas are commonly fixed, 
while the number of users and the spreading factor tend to infinity with their ratio fixed. Note that a many-antenna 
limit, in which the numbers of transmit and receive antennas tend to infinity, may be taken after this large-system 
limit [27 1 ■ In this paper, the numbers of transmit and receive antennas are kept finite. 

It is worth considering the no-CSI case since CSI is unknown in advance for practical MIMO DS-CDMA systems. 
Iterative MUDD was extended to iterative channel estimation (CE) and MUDD (CE-MUDD) in [28 1. In this scheme, 
the channel estimator utilizes soft feedback from the per-user decoders for refining the initial channel estimates. In 
terms of complexity, iterative CE-MUDD requires updating the filter coefficients of the channel estimator and the 
detector in every iteration. This implies that the computational complexity of iterative CE-MUDD is higher than 
that for receivers based on one-shot channel estimation, in which data estimation is performed without refining the 
initial channel estimates. In order to reduce the complexity of iterative CE-MUDD, linear channel estimators, such 
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as the LMMSE channel estimator, have been commonly used. Linear channel estimators use known pilot symbols 
to obtain the initial channel estimates, while the optimal channel estimator can attain the initial channel estimates 
with no pilot symbols. Numerical simulations [ 28 ]— 1 30] demonstrated that iterative LMMSE-based CE-MUDD 
can provide significant performance gains for highly-loaded systems, compared to receivers based on one-shot 
channel estimation. However, these works did not discuss how to design the length of symbol periods assigned 
for transmission of pilot symbols, called training phase. Increasing the length of the training phase improves the 
accuracy of channel estimation, while transmission of pilot symbols reduces the transmission rate. Thus, how to 
design the length of the training phase should provide a great impact on achievable spectral efficiency for the no-CSI 
case. The goal of this paper is to optimize the length of the training phase on the basis of information-theoretical 
capacities. 

Simple modulation schemes, such as quadrature phase shift keying (QPSK) or quadrature amplitude modulation 
(QAM), are commonly used for coherent receivers that estimate CSI explicitly. Thus, the channel capacity for a 
fixed modulation scheme, called constrained capacity, corresponds to a performance bouncQ for iterative receivers, 
while the true capacity might be achieved by using complicated modulation [ 3 1 1 - It is a challenging issue to derive 
analytical formulas for the constrained capacities of wireless communication systems with no CSI, since the optimal 
channel estimator is nonlinear. Lower bounds for the constrained capacities have been derived instead [32], [33]. 
The basic idea in these works for obtaining lower bounds is to replace the noise due to channel estimation errors by 
additive white Gaussian noise (AWGN). This replacement reduces optimal nonlinear channel estimation to LMMSE 
channel estimation. In this paper, this type of a lower bound is referred to as a lower bound based on LMMSE 
channel estimation. 

We derive a lower bound for the constrained capacity of randomly-spread MIMO DS-CDMA systems with 
no CSI, on the basis of LMMSE channel estimation. The lower bound can be regarded as a performance index 
for iterative CE-MUDD based on LMMSE channel estimation. In an information-theoretical point of view, both 
LMMSE channel estimates and covariances of the estimation errors should be used in MUD [34], while the use 
of the covariances is uncommon in practice. In this paper, we consider suboptimal LMMSE channel estimation in 
which only a part of the covariances for the LMMSE estimation errors are used, while all covariances are used 
in the true LMMSE channel estimation. The use of suboptimal LMMSE channel estimation allows us to evaluate 
a lower bound for the constrained capacity. We refer to this lower bound as the performance of the joint CE- 
MUDD based on suboptimal LMMSE channel estimation, or simply as the performance of the joint CE-MUDD. In 
order to investigate the benefits obtained by using iterative CE-MUDD, we also analyze the performance of three 
receivers based on one-shot LMMSE channel estimation: one-shot CE-MUDD, an optimum separated receiver, and 
an LMMSE receiver. Table U lists the four receivers considered in this paper. The one-shot CE-MUDD performs 
joint MUDD based on one-shot LMMSE channel estimation. Intuitively, the performance of the one-shot CE- 

1 Intuitively, one may expect that the optimal CE-MUDD can be implemented with iterative CE-MUDD. However, it is unclear whether the 
solution of the iterative CE-MUDD converges to the global optimal solution. Therefore, the constrained capacity might be a loose bound for 
the performance of iterative CE-MUDD. 
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TABLE I 

Four receivers considered in this paper. 





channel estimation(CE) 


MUD 


feedback 


to CE 


to detector 


joint CE-MUDD 


suboptimal LMMSE 


optimal 


available 


available 


one-shot CE-MUDD 


suboptimal LMMSE 


optimal 




available 


optimum separated receiver 


suboptimal LMMSE 


optimal 






LMMSE receiver 


suboptimal LMMSE 


LMMSE 







MUDD corresponds to a performance bound for an iterative receiver obtained by eliminating the feedback from 
the per-user decoders to the channel estimator in iterative CE-MUDD. The optimum separated receiver performs 
separated optimal detection and decoding on the basis of one-shot LMMSE channel estimation. The performance 
of the optimum separated receiver corresponds to a performance bound for a non-iterative receiver obtained by 
eliminating the feedback from the per-user decoders to the channel estimator and to the detector. The LMMSE 
receiver is obtained by replacing the optimal detector in the optimum separated receiver by the LMMSE detector. 
To the best of our knowledge, no analytical results for joint CE-MUDD are obtained, except for non-iterative linear 
receivers ll34l and iterative CE-MUDD based on hard decision feedback ll35ll . The methodology developed in this 
paper is applicable to the analysis of iterative LMMSE-based CE-MUDD. See [36 1 for details. 

Our large-system analysis is based on the replica method i37ll - ll39ll . which is a powerful method for analyzing 
randomly-spread DS-CDMA systems HI, |[T8)-||20), gO) and MIMO systems I4ll-ll44l, The replica method is 
based on several non-rigorous procedures at present time. In this paper, we assume that results obtained by using 
these procedures are correct since their proof is beyond the scope of this paper. See |45|-|47| for recent progress 
with respect to the assumptions of the replica method. 

This paper is organized as follows: After summarizing the notation used in this paper, in Section [TT] we introduce 
a discrete-time model of MIMO DS-CDMA systems. In Section HID the joint CE-MUDD based on suboptimal 
LMMSE channel estimation is defined. In Section IIVI we define the three receivers based on one-shot LMMSE 
channel estimation. Section [V] presents the main results of this paper. In Section I VII we compare the performance 
of the joint CE-MUDD with that of the three receivers based on one-shot LMMSE channel estimation. Numerical 
simulations for finite-sized systems are also performed to demonstrate the usefulness of our large-system analysis. 
In Section IVLT1 we conclude this paper. The derivations of the main results are summarized in the appendices. 

A. Notation 

For a complex number z E C and a real number x € R, Re(z), Im(z), i, z* , \ogx, and lnx denote the real 
part, imaginary part, imaginary unit, complex conjugate, log 2 x, and log c x, respectively. |„4| stands for the number 
of elements of a set A. For a matrix A, A T , A H , Tr(A), and det(A) denote the transpose, conjugate transpose, 
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trace, and the determinant, respectively. In stands for the TV x TV identity matrix. 1 jy denotes the TV-dimensional 

(n) 

vector whose elements are all one. represents the TV-dimensional vector in which the nth element is one and 
the other elements are all zero. denotes the set of all positive definite n x n Hermitian matrices. ® denotes the 
Kronecker product operator between two matrices. S(-) represents the Dirac delta function, while Sij denotes the 
Kronecker delta. p(x) and p(y\x) stand for the probability density function (pdf) of a continuous random variable x 
and the conditional pdf of a continuous random variable y given x, respectively. We use the same symbol p(x) for 
the probability mass function (pmf) of a discrete random variable x. a ~ P(a) indicates that the distribution of a 
random variable a equals a distribution P(a). If the pdf p{a) of a exists, we use the notation a ~ p(a) to represent 
that a follows the distribution whose pdf is given by p(a). CJV(m, S) denotes a proper n-dimensional complex 
Gaussian distribution with mean m £ C N and a covariance matrix S e ll48l . The pdf of the n-dimensional 
complex Gaussian random vector x ~ CJV(m, S) is defined as p(x) — g n (x — m; S), given by 

9n(y; s) = — ^=e- yHj: ' lv . (l) 

D(A\\B) stands for the Kullback-Leibler divergence with the logarithm to base 2 between CW(0, A) and £/V(0, B). 
I(x;y\z) denotes the conditional mutual information with the logarithm to base 2 between a random variable x 
and a random variable y conditioned on a random variable z. 

The indices of chips, symbol periods, users, transmit antennas, and replicas are denoted by I, t, k, m, and a, 
respectively. In this paper, indices themselves have meanings, like the argument of distributions. Symbols with 
several superscripts or subscripts are used in this paper. We write sets of the symbols as follows: For a symbol af^ 
and a subset J of indices {j}, the set A\ j denotes a subset {a^j : for j G J} for fixed i and k. When J equals 
the set of all indices {j}, A\ j is also written as A\. The sets A\, A, and so on are defined in the same manner. The 
two sets A\ and Aj should not be confused with each other. The set J\{j} — {j' G J : j' 7^ j} denotes the set 
obtained by eliminating the element j from J . When J equals the set of all indices {j}, J\{j} is simply written 
as \{j}. As notational convenience for subsets of the natural numbers N, we use [a,b) = {i E N : a < i < b} 
for integers a and 6(> a), which is always used as subscripts or superscripts for discrete sets. The other sets 
[a, b), (a, b), and so on are defined in the same manner. As exceptions, the two sets {t' £ N : 1 < t' < i] and 
{<' e N : t < t' < T c } for coherence time T c are denoted by % and Ct, respectively, instead of [l,t] and [t, T c ]. 

We use symbols with tildes and hats to represent random variables for postulated channels and estimates of 
random variables, respectively. Underlined symbols are used to represent random variables for decoupled single-user 
channels. Note that there are several exceptions in the replica analyses presented in Appendix lei and Appendix IdI 

II. MIMO DS-CDMA Channel 

We consider the uplink of a synchronous iV-user frequency-flat fading MIMO DS-CDMA system with spreading 
factor L, in which each user and the receiver have M transmit antennas and TV receive antennas, respectively. A per- 
antenna spreading scheme is investigated in this paper: Different spreading sequences are used for different transmit 
antennas of each user. See l49ll for a generalization of spreading schemes. We assume block-fading channels with 
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coherence time T c , i.e., fading coefficients do not change during T c symbol periods, and they are independently 
sampled from a distribution at the beginning of the next coherent interval. 

The input symbol u t ^ m € C for the mth transmit antenna of user k in symbol period t is spread with a 
spreading sequence {si t t,k,m : I = 1, ■ ■ ■ , L}. The chip-sampled received vectors {y l t £ C N : I = l,...,L} in 
symbol periods t = 1 , . . . , T c are given by 

^ K M 

Vl.t = ~~rf J]] hk, m Sl.t,k,mUt.k.m + ™Z.t- (2) 

V L fe=lm =l 

In (01, ni.t ~ CAf(0,NolN) represents the AWGN vector with variance Nq. Furthermore, hk. m € denotes the 
channel vector between transmit antenna m of the fcth user and the receiver. Note that the channel vectors are fixed 
during T c symbol periods. 

The assumption of frequency-flat fading channels might be an unrealistic assumption since practical MIMO DS- 
CDMA systems commonly operate over frequency-selective fading channels. For the sake of simplicity, however, 
we consider frequency-flat fading channels. An extension to frequency-selective fading channels is possible by 
considering the assumption of independent spreading sequences across different resolvable paths ||4), ll34l . For 
details, see ll50l . 

The receiver does not have CSI in advance, which is information about all realizations of the channel vectors 
{hk : m : for all fc, m}, while it knows all spreading sequences, the variance A*o of the AWGN, and the statistical 
properties of all channel vectors and input symbols. In order for the receiver to estimate the channel vectors, consider 
that the first r symbol periods T T = {1, . . . , t} in each coherent interval are assigned to a training phase, and that 
the remaining r' = T c — r symbol periods C r +i = {t + 1, . . . ,T C } are assigned to a communication phase. The 
length of the training phase r is a design parameter, which will be optimized on the basis of large-system results. 
User k transmits pilot symbols {x t: k,m G C} known to the receiver from the mth transmit antenna in the training 
phase t € T T , and subsequently sends data symbols {&t,k,m <= in the communication phase t E C T +i- Therefore, 
the input symbol u t: k,m is given by 

!X t ,k,m for t e Tt, 
(3) 
bt,k, m for t E C T +i- 

Throughout this paper, we assume that the input symbols {u t .k. m } are mutually independent for all t, k, m, 
and that each u t .k. m is a zero-mean random variable satisfying \u t: k,m\ 2 = P/M. In numerical results, unbiased 
QPSK input symbols with \u t: k,m\ 2 — P/M are used. Furthermore, it is straightforward to extend the results to 
the unequal power case. Next, we assume that the channel vectors {hk : m} are mutually independent for all fc, 
m, and that each hk, m h as independent and identically distributed (i.i.d.) circularly symmetric complex Gaussian 
(CSCG) elements with unit variance. Finally, we assume that the spreading sequences {sj t k TO : for all I, t, k, m} 
are i.i.d. for all /, t, k, m, and that each si^k,™ is a CSCG random variable with unit variance. We have made the 
CSCG assumption of each chip for the sake of simplicity in analysis. We believe that the main results presented in 
this paper hold for a general distribution of si.t : k,m with zero mean and finite moments, as shown numerically in 
Section ED 
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We shall present several sets used in this paper: The set y t = {y l f 6 C N : I = 1, . . . , L} denotes the received 
vectors in symbol period t. The set St — {si t t,k,m '■ f° r ai l U k, m} represents the spreading sequences in symbol 
period t. The sets Bt : k — {Pt,k,m '■ f° r a ll m } an d %t,k = {xt.k.m ■ for all m} represent the data and pilot 
symbols for user k in symbol period t, respectively. All data and pilot symbols in symbol period t are denoted 
by Bt — {Bt : k '■ for all k} and X t — {X t .k '■ for all k}. The set Ut = {ut,k,m '■ f° r a U K m} represents the 
input symbols in symbol period t. The channel vectors for user k are denoted by Hk = {hk. m '■ for all m}. The 
set Xt — {y t , S t , U t } denotes the information about the received vectors y t , the spreading sequences S t , and 
the input symbols U t in symbol period t. The set I t = {y t , S t } is obtained by eliminating the input symbols 
Ut from I t . The training phase in stage t of successive decoding, introduced in the next section, is denoted by 
7t_i = {1, . . . , t — 1}, while the following stages are denoted by Ct+i = {t + 1, . . . , T c }. The set C T +i is also used 
to represent the communication phase. Note that the same index t as for symbol periods is used for the indices of 
stages in successive decoding. A list for several sets used in this paper is summarized in Appendix iGl 

III. JOINT CE-MUDD 
A. Joint CE-MUDD Based on LMMSE Channel Estimation 

In order to define joint CE-MUDD based on LMMSE channel estimation, we shall derive a lower bound of the 
constrained capacity based on LMMSE channel estimation. The definition of joint CE-MUDD considered in this 
paper will be presented in the next subsection. 

We start by the constrained capacity of the MIMO DS-CDMA channel with no CSI. Let y Tr — {y t : t e %} 
and yc T+1 = {yt ■ t € C T +i} denote the received vectors in the training and communication phases, respectively. 
Furthermore, we write all data symbols B and all pilot symbols X as B = {Bt : for all t 6 C r +i} and X = {X t : 
for all t £ %-}■ The constrained capacity for the no-CSI case is given by the mutual information per chip between 
all data symbols B and {yc T+1 , 3V T > £7 X} known to the receiver IfSTI , with S = {S t : for all t] denoting all 
spreading sequences, 

C op t = ^rI(B;y Cr+1 ,yT T ,S,X). (4) 
Using the chain rule for mutual information, we obtain 

c pt = T jf-j{^ ycr +1 \yr, ,s,x) + -^-m y TT , s 7 x) 

=^rmy Cr+1 \yT T ,S,X), (5) 

where the last equality holds since the data symbols B are independent of {yr T ,S, X}. For notational convenience, 
we hereafter omit conditioning with respect to 3V T ^ S> ar, d 

It is well known that the constrained capacity can be achieved by successive decoding. In order to obtain a 
lower bound for the constrained capacity, a successive decoding strategy has been considered in ||52l , ||53l . In 
this strategy, channel estimation and MUD in successive decoding are replaced by suboptimal ones that allow 
us to evaluate the performance analytically. We present joint CE-MUDD based on LMMSE channel estimation, 
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following the successive decoding strategy. In successive decoding with t' = T c — t stages, the data symbols {Bt} 
are decoded in the order t = r + 1, . . . , T c . Stage t G C r +i consists of K substages, in which the data symbols 
{Bt,k} in symbol period t are decoded in the order k = 1, . . . ,K. We focus on substage k within stage t. Let 
£?( T)t ) = {Bt> : t' = t + 1, . . . , t — 1} and 3fc T+ i\{t} = {3-V : t' E C T +i,t' ^ t} denote the data symbols decoded 
successfully in the preceding stages and the received vectors in the communication phase except for y t , respectively. 
The independency of {Bt} for all t implies that Bt is independent of the received vectors {3V} in different symbol 
periods t' ^ t. By using the chain rule for mutual information repeatedly, © yields 

my Cr+1 )= J2 HBaycr+AB^t)) 

t = T + l 

T c 

= [J(ft;yc r+ i\{t}iB(r,t))+J(B*;y*iB(r > t),y Cr+1 \{ t })] 

t—T+l 

= J2 I(Bf,y t \B (T , t) ,y eT+1 \ { t}), (6) 

t—T+l 

where the last equality holds since Bt is independent of 3^e T+ i\{t}- If there were dependencies between the data 
symbols in different symbol periods, the equality would not hold. Applying the chain rule for mutual information 
to gives 

, Tc K 

Copt = jwr ^2 ^2 c ° P k> ^ 

C {=T+1 fc=l 

with 

C°S = I(Bt,k\yt\Bt,[i,k),B (Ttt) ,y CT+1 \ { t}), (8) 

where St,[i,fe) = {Bt,k' '■ k' = 1, . . . , k — 1} denotes the data symbols decoded in the preceding substages. 

We focus on each mutual information (HJ. In estimating Bt,k> the data symbols Br T! t) decoded in the preceding 
stages are available for channel estimation, while the data symbols BfUk) decoded in the preceding substages are 
used in MUD. The optimal receiver achieving the mutual information ^ consists of the optimal channel estimator, 
the optimal detector, and per-user decoders. The optimal channel estimator uses the information — {It' '■ f E 

7t-i} in the preceding stages and the information Ic t+1 — {If ■ f E C t +i} in the following stages to construct the 
posterior pdf p(H\I%_ 1 ,Ic t +i) °f a H channel vectors H = {Hk ■ for all k}, which is sent to the optimal detector. 
Note that p(ri\Ij- t _ 1 ,Ic t+1 ) is non-Gaussian, since Ic t+1 — {3^t',«5 t / : t' E C t +i} is an incomplete data set, i.e, it 
does not contain the data symbols {B t '}- Consequently, the optimal channel estimator is nonlinear. In the optimal 
detector, the posterior pdf p(B t: k\yt,S t , B t ti^\,X'r t _ 1 ,Xc t+1 ) is constructed and fed to the corresponding decoder, 
by utilizing the information about the received vector y t , the spreading sequences St, the data symbols Btuk) in 
the preceding substages, and the posterior pdf p(H\I%-i ,%C t +i) provided by the channel estimator. 

It is difficult to obtain an analytical expression for the constrained capacity @, since the optimal channel estimator 
is nonlinear. In order to obtain a lower bound based on LMMSE channel estimation, we consider a lower bound 
for ®, 

0°$ > /(Bt,fc;yt|Bt,[l,fc),B(r, t ),y(r,t)), (9) 
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with y^r.t) = {yv '■ t' = t + 1, . . . ,t — 1} denoting the received vectors in the preceding stages. In the derivation 
of (O, we have used the assumption of independent data symbols and the fact that conditioning does not increase 
(differential) entropy. Recall that conditioning with respect to JV T , S, and X in the mutual information is omitted. 
Substituting © to © yields the lower bound C op t > C, with 

T c K 

c = Tt J2 ^HBt^ytiBt^iT^St), do) 

C t=T+l fc=l 

where we have re-written the conditioning random variables as {B t mm, 1-%-n St}, using the fact that the lower 
bound (O is independent of the spreading sequences Sc t+1 = {St> : t' £ Ct+i} in the following stages anymore. 

The receiver achieving the lower bound ( TTOb is obtained by replacing the optimal channel estimator by a channel 
estimator that sends the posterior pdf p(H\Ij- tl ), instead of p(fl\Irj~ t _ x ,Ic t+1 )■ It is straightforward to find that this 
channel estimator is equivalent to the LMMSE channel estimator, as shown in the following remark. Thus, we refer 
to the lower bound ( fT0T > as the spectral efficiency of the joint CE-MUDD based on LMMSE channel estimation. 

Remark 1. We shall show that the channel estimator corresponding to the lower bound (0 is the LMMSE channel 
estimator. We first confirm that the channel estimator is linear. The information Tj- t l in the training phase is a 
complete data set for estimating the channel vectors %, i.e., Xf t l contains the received vectors, the spreading 
sequences, and the input symbols in each symbol period. The Gaussian assumption of the channel vectors ~ 
CA/"(0, In) implies that the posterior pdf p('H\X%_ 1 ) is a proper complex Gaussian pdf whose mean is given by 
a linear transform of the received vectors. Thus, the channel estimator based on the posterior mean estimator is 
linear. 

We next show that the channel estimator is equal to the LMMSE channel estimator. See Appendix\A\for derivations 
of the LMMSE estimator. Following H32V , [33], we replace the term hk im si,t' ,k,mbt> ,k,m in (0 for the following 
stages t' = t + 1, . . . ,T C by a proper complex Gaussian random vector whose mean and covariance are given 
by hk, m SLt' ,k,mM[bt' ,k,m] and \ 

Sl,t'k,m I ^[^k,mbt' ,k,m (hk, m bt',k,m) H }, respectively. However, the mean is equal 
to zero, because of E[bf t k,m] — 0. As a result, the received vectors in the following stages is independent of 
the channel vectors anymore. In other words, the LMMSE channel estimator does not utilize the information 
yCt+i — {yf '■ t' € Ct+i} in the following stages, since the LMMSE channel estimator postulates that the received 
vector Hi t , in (0 is independent of the channel vectors for t 1 G Ct+\. Therefore, the lower bound (0 corresponds 
to the spectral efficiency of the receiver based on the LMMSE channel estimation. 

B. Joint CE-MUDD Based on Suboptimal LMMSE Channel Estimation 

It is still hard to evaluate the spectral efficiency ( TTOb of the joint CE-MUDD based on LMMSE channel estimation, 
since the posterior pdf p(H\lr t -i) sent by the LMMSE channel estimator is not factorized into the product of 
the marginal posterior pdfs rifc=i PiHkl^hi-i)- Instead, we consider a suboptimal receiver in which the LMMSE 
channel estimator is replaced by a suboptimal LMMSE channel estimator that sends the product J^fcLi P^Hh^Tt-i)- 
The spectral efficiency for the suboptimal receiver provides a lower bound for the spectral efficiency ( TTOb based 
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Fig. 1. Joint CE-MUDD based on suboptimal LMMSE channel estimation. 



on LMMSE channel estimation. Since the LMMSE detector uses no covariances of the channel estimation errors 
between different users |34|, as noted in Section HV-DI no performance loss due to this replacement occurs for 
the LMMSE receiver. On the other hand, the optimal detector utilizes all information about the channel vectors, 
i.e., the joint posterior pdf p('H\Ij- t _ 1 ). It is unclear whether the performance loss caused by using the suboptimal 
LMMSE channel estimator is negligible in the large-system limit for the optimal detector. 

Let us focus on substage k within stage t. The suboptimal receiver consists of a suboptimal LMMSE channel 
estimator, the optimal detector, and the per-user decoders (See Fig. [TJ. The channel estimator uses the pilot symbols 
and the data symbols decoded in the preceding stages to estimate the channel vectors. More precisely, the channel 
estimator constructs the posterior pdf p(7ik\^T t -i) °f me channel vectors Hk for each user by utilizing the known 
information 2r t _ a = {y t >, St', U t > : t' e 7t-i}, 

I nl^pjyt* \n, sr,ur) P (H)dH\ {k } 
I n*~=i p(y« \n,s t , , u v ) P (H)dH 

with H\{k} = {Hk 1 '■ for all k! 7^ k} denoting the channel vectors except for Hk- In ( fTTT i. the pdf ptyt' \H,Sf Mt 1 ) 
represents the MIMO DS-CDMA channel (O in symbol period t'. Then, the marginal posterior pdfs {p(Hk\Z%-i) '■ 
for all k} are sent towards the optimal detector. Sending p(Hk \^T t -i) is equivalent to feeding the LMMSE estimates 
h^km = / hk,m.p{'Hk\lTt-i)d'Hk and the covariances of the estimation errors Ah^^ — hk t m — h^m f° r a ^ TO ' 
since the posterior pdf p{Hk\Ir t -i) i s CSCG. 

The optimal detector uses the information about the received vectors y t , the data symbols B t n^) decoded in the 
preceding substages, the spreading sequences S t , and the posterior pdfs {p(Hk\^ht-i)} provided by the channel 
estimator to detect the data symbols Bt.k- The term "optimal detector" indicates that the detector is optimal among 
all detectors that regard the product Y[^=iP(^-k\^Tt-i) °f me marginal posterior pdfs as the true joint posterior 
pdf. The optimal detector constructs a posterior pdf p(Bt l fe|3^,Bt,[i,fc),<St,27^_ 1 ) and subsequently forwards it to 
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the corresponding per-user decoder. In the posterior pdf, the data symbols B^ k = {bt.k.m G C : for all m} ~ 
rim=i P{bt,k,m) denotes the data symbols in the MIMO DS-CDMA channel postulated by the optimal detector 

Vl.t = —ff ^ ^ ^fe,m s !,t,fc,nA,fc,m + (12) 



7n— 1 



with hi t t ~ CJV(0, NoIn). In (fT2l . = {hfc >m € : for all m} denotes random vectors representing the 
information about W k provided by the channel estimator. The joint posterior pdf of H = {T~L k ■ for all k} satisfies 



A" 



P (H\l Tt _ 1 ) = l[p(n k =H k \lT t - 1 ). (13) 

fc=i 

Note that p(W\lj- t _ 1 ) 7^ p(H\l%_ 1 ) since p(H\Ij- t _ 1 ) is not decomposed into the product of the marginal pdfs. 
The posterior pdf p(B t ,k\yt,^t,{i,k)^S t ,I%-i) is an abbreviation of p(B t ,k\yt = yt,B~t,[i,k) = Bt,[i,k),St,ZT t -i)< 
with y t — {yit G : for all /} and Bt,[i,fc) = {^t,fc' : k' = 1, . . . ,k — 1} denoting the received vectors in ([T2i i 
and the postulated data symbols in the preceding substages, respectively, given by 

p{B t ,k\y t ,B tAl>k) ,S u lr t ^) = - r - g - ~ g g , (14) 

withB t , (fc , K] = {B tlfc / : fc' = fc + l,...,tf} and B t , [fe>K] = {B tlk > : k' = k, . . . , K }. In O, pC^St,^,:^) is 
given by 

p(yt\BuS u l rt -i) = J pmUiSuBMUpr^dU, (15) 

where p(yt\W, St, Bt) represents the MIMO DS-CDMA channel (TTZt postulated by the optimal detector in symbol 
period t. The marginal posterior pdf (fT~4T > would reduce to the true one p{Bt t k\yti^t,[i,k)iSt,I%_- l ) if tne channel 
estimator sent the joint posterior pdf p(H\lj- t -i)- 

The spectral efficiency Cjoint of the joint CE-MUDD based on the suboptimal LMMSE channel estimation is 
given by, 

K 



Cjoint = 7^- i>2 Ctk > (16) 

c t=r+l k=l 

with 

C tlfc = J(iB t , fc ;B tlfc |B ti[lifc) ,Xr t _ 1) 5t). (17) 

The conditional mutual information (TTTb is characterized by the equivalent channel between user k and the corre- 
sponding decoder, given by 

p(B t ,k\Bt,k,S t ,[i,k),lT t -i,S t ) = J p(Bt,k\yt,B tt[ltk) ,St,lTt-i)p(yt\B t ,[i,k],St,lTt-i)dyt, (18) 

where 

p(yt\S t ,[i,k},S t ,l Tt _ 1 ) = J piyt^SuBMBt^KMnilT^dHdBt^K], (19) 

with B tj[ljk] = {Bt, fe , : kl = l,...,k} and B t , (fci;K] - {B t , k > : fc' = fc + l,...,K}. In tOLl, p(y t |W,5t,Bt) 
represents the MIMO DS-CDMA channel (fJJ in symbol period t. Note that Bt.k in (Tf7T i plays the role of random 
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Fig. 2. One-shot CE-MUDD based on suboptimal LMMSE channel estimation. 



variables representing the information about Bt,k provided to the per-user decoder, although we have introduced 
Bt t k as random variables representing the data symbols postulated by the optimal detector in (fT2l >. 

The spectral efficiency (fTSI l of the joint CE-MUDD based on the suboptimal LMMSE channel estimation is 
a lower bound for the spectral efficiency ( TTOb based on the LMMSE channel estimation. We hereafter focus on 
the lower bound ( TT~6T >. Thus, the joint CE-MUDD based on the suboptimal LMMSE channel estimation is simply 
referred to as the joint CE-MUDD. The spectral efficiency (fT~6b of the joint CE-MUDD should not be confused 
with the spectral efficiency (TTOb based on the LMMSE channel estimation or with the constraint capacity ©. 

IV. Receivers Based on One-Shot Channel Estimation 

A. One-Shot Channel Estimation 

For comparison with the joint CE-MUDD, we consider three receivers based on one-shot channel estimation, 
in which the decoded data symbols are not used to refine the channel estimates. A first receiver performs joint 
MUDD based on one-shot LMMSE channel estimation, called one-shot CE-MUDD. This receiver is obtained by 
eliminating the feedback from the per-user decoders to the suboptimal LMMSE channel estimator in Fig. Q] A 
second receiver performs separated decoding based on one-shot LMMSE channel estimation, called the optimum 
separated receiver. The receiver is obtained by eliminating the feedback from the per-user decoders to the channel 
estimator and to the optimal detector. The last receiver is an LMMSE receiver in which the detector in the optimum 
separated receiver is replaced by an LMMSE detector. The four receivers considered in this paper are listed in 
Table Q] 

B. One-Shot CE-MUDD 

We define the one-shot CE-MUDD based on the suboptimal LMMSE channel estimation (See Fig. |2). In stage t 
of the joint CE-MUDD, the channel estimator has used the information = St', Uy ■ f € 71— 1} 
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Fig. 3. Optimum separated receiver based on suboptimal LMMSE channel estimation. 



about the received vectors {3^t'}, the spreading sequences {Sf}, and the transmitted symbols Uf in symbol periods 
t' € 7t-i. On the other hand, the LMMSE channel estimation in the one-shot CE-MUDD cannot utilize the data 
symbols \B t ' '■ f = r + 1, . . . ,t — 1} decoded in the preceding stages. This restriction is equivalent to assuming 
that the information I( T ,t) = {3V, <->t') Mtf ■ t' = t + 1, . . . ,t — 1} in the preceding stages is not used for channel 
estimation, as discussed in Remark Q] In other words, the information Ij- t = {3V, Sf, X t i : for all t' € %-} in the 
training phase is utilized for channel estimation. Thus, the suboptimal LMMSE channel estimator in the one-shot 
CE-MUDD provides the marginal posterior pdfs {p(Hk\Zr T )} to the optimal detector. Note that sending the joint 
posterior pdf p(H\Ij- T ) is of course optimal. Strictly speaking, the one-shot CE-MUDD should be referred to as the 
one-shot CE-MUDD based on the suboptimal LMMSE channel estimation. However, we simply call it the one-shot 
CE-MUDD, since the true LMMSE channel estimator is not analyzed in this paper. 

The marginal posterior pdf p(Hk\ZT T ) i s e q ua l to the one constructed in the first stage of the joint CE-MUDD. 
Thus, the spectral efficiency of the one-shot CE-MUDD is given by 

C one = ( 1 - L- 1 V Cr+l,*, (20) 

^ ±cJ fc=l 

where the mutual information C T +i,k is defined as ( fTTb . 

C. Optimum Separated Receiver 

We define the optimum separate receiver based on the suboptimal LMMSE channel estimation (See Fig. [3]). 
In substage k of the joint CE-MUDD, the optimal detector has used the data symbols Bt \i k) in the preceding 
substages to mitigate MAI. In the optimum separated receiver, on the other hand, the information B t rx,fc) is n °t 
utilized for MUD. The posterior pdf p(Bt,k\yt>St,Ij- r ) is constructed and sent to the corresponding per-user 
decoder. The posterior pdf is equal to the one constructed in the first substage of the joint CE-MUDD. Thus, the 
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spectral efficiency of the optimum separated receiver is given by 

K ( T 



Cscp — -jr yl - — j Ct+i,i, (21) 
where the mutual information C T +i,i is defined as (T% . 

D. LMMSE Receiver 

The LMMSE receiver is obtained by replacing the optimal detector in the optimum separated receiver by the 
LMMSE detector. We first derive the LMMSE estimator of the data symbols Bt,k> after which we define the 
information about Bt,k provided from the LMMSE detector to the corresponding decoder. The LMMSE estimator 
of Bt,k would be obtained by regarding Bt,k as CSCG random variables Bfi = {bri m € C : for all m} with 
E^S.m^S.m')*] = {P/M)S m , m * if the receiver had perfect CSI (20). However, the posterior mean of B^ is 
nonlinear in the received vectors since the channel model (PL?) includes multiplicative noise due to the influence 
of channel estimation errors. We approximate the channel model (fT2l by a channel model without multiplicative 
noise. We extract the term including channel estimation errors from the first term on the right-hand side of Sl2\ . and 
subsequently approximate the extracted one by an AWGN term with the same covariance. Therefore, the MIMO 
DS-CDMA channel in symbol period t postulated by the LMMSE detector is given as 

| K M 



' k—1 m—1 



with hf^ ~ CN{0,NqIn)- In (l22l . h km denotes the LMMSE channel estimates h^ m = J hfe >m p(?{fc|2 T )d?{fc. 
The vectors {wPt, G C^} are independent CSCG random vectors with the covariance matrix (P/M)S/- iOT for all 
t, k, and m, in which Sfe jTO denotes the covariance matrix of the channel estimation errors Ah k T m = hk, m — h k m , 
i.e., S fe , m = E[Ah^ m (Ah^J H \lT T }. Note that E[AAi^ m 6 t , fc , m (A^J m 6 t , fc) , m ) H |I T ] - O for fc ^ fc', since the 
data symbols {6t,fe,m} are independent unbiased random variables. Thus, the same detector would be obtained even 
if the true LMMSE channel estimator was used instead of the suboptimal one. 

Let j)j = ft e ^ N ■ f° r au> ^} denote the received vectors postulated by the LMMSE detector in symbol 
period t. Furthermore, we write the postulated data symbols in symbol period t and the postulated data symbols 
except for Bt.k as B^ = {B[ L k : for all k} and B^X^ — {Bf^, : for all k' ^ k}, respectively. The linear estimator 
of Bt,k is given by the mean of Bf£ with respect to the posterior pdf 

/ p{y^ ] = y t \b[ l) , s t , x Tt )p(b[ l) )dB ( t L) 



with 



K M 



k=l m—1 

;(L)|r(L) c r„~.( L ) 



where p{y\ >\B\ f',S u {w ( t ^ m }, {h k , m }) represents the MIMO DS-CDMA channel (22) in the t{> r)th symbol 
period postulated by the LMMSE detector. 
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The LMMSE detector provides the posterior pdf d23l l to the decoder of the fcth user. The transfer of ( |23l l is 
equivalent to feeding the LMMSE estimate of Bt,k an d the covariance matrix of its estimation errors. We remark 
that our LMMSE receiver is equivalent to the one proposed by Evans and Tse B4l for N = M = 1. 

The spectral efficiency of the LMMSE receiver is given by 

K ( \ K 

C ^ = T\ 1 -TJ ^ I{ - Bt ^ Pr> ^' (25) 

In dZBI l. the mutual information I(Bt,k] Bfi }\Xr,St) is characterized by the equivalent channel between user k and 
the corresponding decoder for the LMMSE receiver, 

p(B^\B t , k ,S t ,lT T ) = J p(B^\y^ = yt,St,lr T )p(yt\Bt,k,S t ,l TT )dy t , (26) 

where p(yt\Bt.k, St,%T T ) i s defined in the same manner as in dl9t . 

V. Main Results 

A. Large-System Analysis 

In order to evaluate the spectral efficiencies of the four receivers listed in Table [I] we consider the large-system 
limit, in which the number of users K and the spreading factor L tend to infinity while the system load j3 = K/L and 
the number of transmit and receive antennas are kept constant. More precisely, we evaluate the spectral efficiencies 
by analyzing asymptotic properties of the equivalent channel between a finite number of users and their decoders. 
We write a finite number of users as JC, which is a finite subset of {1, . . . , K}, and consider the large-system limit 
liiriR-,L^co in which K and L tend to infinity with f3 and JC fixed. 

We focus on substage k within stage t of the joint CE-MUDD. A finite subset of users JC is chosen from 
the set {k, . . . , K} of users decoded in the current and following substages, i.e., JC fl {1, . . . , k — 1} = 0. Let 
Bt,K = {Bt,k ■ k € JC} and Btjc = {&t,k '■ k G /C} denote the data symbols and the postulated data symbols for a 
finite subset JC of users in symbol period t, respectively. The equivalent channel p(&t,ic\Bt,lc> Si,[i,fc)>^T t _i)<St>%) 
between the users in JC and their decoders, defined in the same manner as in ([T8T l. is expected to be self-averaging 
with respect to the spreading sequences St in the large-system limit: The equivalent channel converges to the one 
averaged over the spreading sequences for almost all realizations of the data symbols B{,/e> the data symbols B t mm 
decoded in the preceding substages, the known information in symbol periods t' G Tt-x, and the channel 

vectors T~L in the large-system limit. The self-averaging property has been proved for linear receivers lfl5l . ll34l 
and for the constrained capacity of CDMA systems with perfect CSI at the receiver [47|, [54]. However, it is an 
open challenging problem to show whether the self-averaging property holds for general receivers. Therefore, we 
postulate the self-averaging property. 

Assumption 1. The conditional distribution of Btjz. given Bt.tc, Bt,[x,k)> ^T t -i> St, and % converges in law to a 
conditional distribution that is independent of St in tlie large-system limit. 
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Another crucial assumption is replica symmetry (RS) |fl8l . See Appendix IO for a formal definition of the RS 
assumption. In order to present an intuitive understanding of the RS assumption, let us consider an iterative MUD 
algorithm based on belief propagation (BP) ll55ll . Roughly speaking, the algorithm iteratively calculates a local 
minimum solution of an object function, called free energy, in the large-system limit. The global minimum solution 
of the free energy corresponds to the optimal one. The RS assumption implies that the free energy has the unique 
stable solution or at most two stable solutions. If the free energy has many stable solutions, replica-symmetry 
breaking (RSB) should be assumed ||39l . 

Two necessary conditions for checking the RS assumption are known: de Almeida-Thouless (AT) stability l56l and 
the non-negative-entropy condition |39|. The AT condition is a necessary condition for the stability of RS solutions. 
The non-negative-entropy condition is a necessary condition under which the entropy for the posterior distribution 
of replicated random variables is non-negative when the random variables are discrete. Note that the non-negative- 
entropy condition is not defined for the no-CSI case, since the channel vectors are not discrete. See Appendix [D] 
for details. The RS solution for the individually-optimal (IO) receiver with perfect CSI, which corresponds to 
the optimum separated receiver in this paper, has been proved to satisfy the AT stability condition lT8ll . Il57ll . 
Furthermore, that solution satisfies the non-negative-entropy condition derived in ifTSl Equation (69)] for CDMA 
systems with perfect Csj^. These results may imply that the RS assumption is valid for the IO receiver. In fact, 
several rigorous studies have shown that this statement is partially correct: Nishimori |57| has used a gauge theory 
to show that the free energy has no complicated structure for the IO receiver. See Appendix [C] for the precise 
statement. Korada and Montanari [47] have proved that the RS assumption is correct if the free energy under the 
RS assumption has the unique stable solution. Thus, we postulate the RS assumption in this paper. 

We show under these assumptions that the randomly-spread MIMO DS-CDMA channel with no CSI is decoupled 
into a bank of single-user single-input multiple-output (SIMO) channels with no CSI 

Mt,fc,m = h k^U t ,k, m + « t ,fc, m , (27) 

where n t k ~ CAf(0, erf In) denotes AWGN with variance of. The equivalent channel between the users in JC and 
their decoders looks like a bundle of the single-user SIMO channels ( |27] i with the original channel estimator dTTb - 
Furthermore, MAI to the users in JC converges towards MAI from the single-user SIMO channels (|27| i with no 
CSI in the large-system limit. Our result is an extension of the decoupling results for randomly-spread MIMO 
DS-CDMA channels with perfect CSI at the receiver ifTTl . Il20i to the no-CSI case at the receiver. The difference 
from the previous decoupling results appears in the receiver structure for the SIMO channel d27l i, which depends 
on the receiver structure of the original MIMO DS-CDMA systems. 

The equivalent channel between the users in JC and the corresponding decoders is characterized by two receivers 
for the decoupled SIMO channels d27l ). i.e., a receiver for the users in JC and another receiver for the users who 
interfere to the users in JC. In the former receiver joint decoding of all data streams for each user is performed. The 

2 It is straightforward to prove that the non-negative-entropy condition 1 18. Equation (69)] is satisfied for the IO receiver, although the proof 
was not presented in 1181 . 



April 21, 2011 



DRAFT 



IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. , NO. , 2010 



17 



original 
channel estimator 



received vectors of the MIMO 
DS-CDMA channel 



information about channel vectors 

information about b, , 



optimal detector 



information about b uU 



received vectors of the SIMO channels 
in symbol period t 



decoder k 



Fig. 4. Receiver for the users in K. 



receiver consists of the original LMMSE channel estimator providing the marginal posterior pdfs (fTTT i. an optimal 
detector, and the decoder of user k (See Fig. [4j. The latter receiver performs per-stream decoding, and is to be used 
in the decoupled expression for quantifying the strength of MAI from outside JC. The receiver consists of another 
LMMSE channel estimator, the optimal detector, and the per-stream decoders (See Fig. |5j. Note that the information 
Bt,[i,k) m the preceding substages does not appear explicitly in the two receivers. It affects the decoupling results 
via the noise variance of. 

Definition 1 (Receiver for the users in /C). In symbol period f(> r), the optimal detector uses the information 
about the received vectors y_ t k — {y f , m : for all m} of the single-user SIMO channel \27\ in symbol period t 
and about the posterior pdf ( 1771 ) provided by the original channel estimator to construct the posterior pdf 

Um=lP(y t . k . m \ h k,rn,b t ,k, m )p('Hk\lT t - 1 )p(Bt,k) 



p(B t , k \y.,,i Tt _ 1 ) = 



(28) 



jnm=lP(M 4ifcim l^,m,^,fe,mM^fel :r rt- 1 M'Bt,fe)dK/ 5 dB tlfe ' 
where p{y t k m \hk,m,b tt k,m) represents the single-user SIMO channel H27j . Subsequently, the optimal detector 
sends the posterior pdf ( |2<SD towards the decoder of user k. 

Definition 2 (Receiver for the interfering users). The LMMSE channel estimator constructs the posterior pdf 
p(hk, m \Z.T t - 1 .k.m) °f hk,m by utilizing the information I Tt l k m = {T_ t , k m : t' £ 7t-i} in the preceding stages, 
in which k m — {ut' ,k,mi U^, k m \ denotes the information about the input symbol Ut' ,k,m 

and the received 

vector y v km in the single-user SIMO channel ( 12 7t . Subsequently, the posterior pdf p(hk, m \X.T t _i km) " sent 
towards the optimal detector. 

In symbol period t{> r), the optimal detector utilizes the information about the received vector m and 
p(hk,m\2i.T -l k m) provided by the channel estimator for the SIMO channel \27\ , and constructs the posterior pdf 



p(b t ,k,m\y 



.t,k.m'—7i-i,k,m) 



J p(bt,k, m ,hk, m \y tkm ,l Tt _ uk:m )dhk, m , 



(29) 
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Fig. 5. Receiver for the users interfering to the users in K.. 



with 



P(V t k m \ h Krn,hXm)p(Kk,m)p(hk, m \ZT t - U k,m) 



p(bt,k,mt hk m \y , ,Z.%_ 1 k.m) ~ T 7 rT~ 7 \~~77 \~77. Fr VjI lu " 

' ' i P\]i tk J l h k, m M,k, m )p{hM, m )p{h ktm \l Tt _ u 



Subsequently, the optimal detector sends the posterior pdf l\29i towards the corresponding per-stream decoder. 

We summarize several symbols used for claiming the main results. The LMMSE estimate of hk, m f° r the 
single-user SIMO channel d27l i is denoted by 

tkk,m = J hk, m p{h k! J\X Tt _ lXm )dh k ,m- (31) 

When a\, — rj 2 . for t' E the covariance matrix of the LMMSE estimation error Ahj}^ — h k _ m — tkkm i s 

given by £ 2 (of r ,t - 1)1 n Gfl, with 

where (t — 1) corresponds to the length of the training phase in stage t. The estimate h k m is a CSCG random 
vector with the covariance matrix (1 — £ 2 (of r ,i — 1))Jat, since hkm an( ^ ^t^km 3X0 uncorrelated. Furthermore, 
we write the posterior mean of bt,k,m given y t m and fe m as 

(h,k,m) = J h,k,mP(bt,k,m, hk,m\y_ t k m ,Z.T t -i,k,m) db t,k, m dh kim . (33) 

Finally, let C_ t (al r (t — l),cr 2 ) denote the conditional mutual information between the data symbol &t.i,i and the 
received vector U tll for the SIMO channel (|27T > given the information If tl i \, i.e., 

C t {al{t- i),af) =/(6t,i,i;y til)1 |I rt _ 1>1 , 1 ), (34) 

with er 2 , = cr 2 ,.(i — 1) for t' E Tt-x- The conditional mutual information corresponds to the spectral efficiency of 
the receiver defined in Definition [2] for the single-user SIMO channel d271 l. 
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Remark 2. We shall discuss the relationship between the RSB assumption and the decoupling result. Under the RSB 
assumption, the equivalent channel would still be decoupled into single-user channels. However, the noise in the 
SIMO channel A27\l would be non-Gaussian. This implies that if the RS assumption is not correct a naive intuition 
is wrong: MAI should converge to Gaussian noise due to the central limit theorem. In fact, the RSB assumption 
should be considered if the jointly-optimal (JO) receiver is used [18], [59]. 

B. Optimum Separated Receiver 

It is the main part in our derivation to analyze the spectral efficiency ( BTT i of the optimum separated receiver 
in the large-system limit. The spectral efficiencies of the one-shot CE-MUDD and the joint CE-MUDD can be 
straightforwardly derived from the result for the optimum separated receiver. Thus, we first present an analytical 
expression for the spectral efficiency of the optimum separated receiver. Analytical formulas for the spectral 
efficiencies of the one-shot CE-MUDD and the joint CE-MUDD will be presented in Section IV^Cl and Section |V-D1 

Let B tk — {b t km e C : for all m} ~ p(Bt,k\y_ t fc ,^T T !h defined by (f28j), denote random variables representing 
the information about Bt,k, provided by the optimal detector. The equivalent channel between user k and the 
corresponding decoder is given by 

„ M 

p(B t JB^,H k ,lr T ) = / p(Bt,k=B t ,k\y t , k M Y[ {p(y t , k J h K m ,b t .k,n)dy t k m } . (35) 

m=l 

Proposition 1. Assume that Assumption\l\and the RS assumption hold. Then, 

lim p(B t . K \B tx ,lr T ,S t ,n) = TT p(B t k = B t , k \Bt,k,n k M in law, (36) 

K,L— >oo ' 



where of = cr^ for t G C T +i is given by a solution to the fixed-point equation 



(37) 



„ 7- 

with h k l and £ 2 = £ 2 (of r (r) , r) give by 071 ) and 021 ), respectively. In evaluating 071 ), a\, = cr 2 r (r) for t' E T T 
is given as the unique solution to the fixed-point equation 

4(r) = N + pPeKir), t). (38) 



If 071 ) has multiple solutions, one should choose the solution minimizing the following quantity 

m.k^h^y^Jl^^+NDiNoltf). (39) 

The derivation of Proposition Q] is deferred to the end of this section. The fixed-point equation d38l was originally 
derived in [34| by using rigorous random matrix theory, while we have used the replica method. The second term 
of the right-hand side of fl37l i corresponds to MAI from the users who do not belong to the users in K.. This 
expression implies that the asymptotic MAI becomes the sum of interference from (K— \IC\)M independent SIMO 
channels (l27l >. Furthermore, each interference is represented by two effects: The first term within the curly brackets 
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in d37b corresponds to contribution from channel estimation errors, and the second term corresponds to MAI from 
the single-user SIMO channel with perfect CSI at the receiver, 

* Tt 

Z t ,k,m = hk,m b t,k,m + «i,fe, m , (40) 
with v tkm ~ CAf(0, [{P/M)£ 2 + a 2 ]I N ). The received vector ^ m conditioned on X_j- ^ m and bt^k,m is 
statistically equivalent to y t m in d27l i under the same conditions, due to \bt,k,m\ 2 — P/M with probability one. 
We remark that this interpretation holds only for phase-shift keying modulations. 

The fixed-point equation (|37| | can have multiple solutions. The criterion (|39l l to select the correct solution 
corresponds to the conditional mutual information L I{Bt,%]yt\St,lT T ) m me large-system limit, although 
its proof is omitted. This phenomenon is related to the so-called phase coexistence in statistical mechanics. For 
the details in the context of wireless communications, see ifTSl , ||20l . The existence of multiple solutions implies 
that the asymptotic performance discontinuously changes at a critical point. This asymptotic result predicts that the 
performance sharply changes in the neighborhood of the critical point for finite-sized systems. 

Proposition[T]implies that the equivalent channel between the users in JC and their decoders is not fully decoupled 
into single-user channels, since the channel estimator in Definition Q] utilizes the information Ij- r depending on 
all users. This is the main difference between the cases of perfect CSI and of no CSI. The following proposition 
indicates that the equivalent channel is decomposed in terms of the spectral efficiency. 

Proposition 2. Assume that Assumption [7] and the RS assumption hold. Then, the spectral efficiency ( 1271 ) of the 
optimum separated receiver converges to the spectral efficiency of the optimum separated receiver for the single-user 
SIMO channel K17\ in the large-system limit, 

Km C Bep =j3M(l-^-)c T+1 (aUr),cr 2 c ), (41) 

K,L— voo \ ^cj 

where C_ t+1 (<t 2 x (t), a 2 ) is given by ( |54| ). In evaluating the right-hand side, cr^ r (r) is given as the solution to the 
fixed-point equation A38h On the other hand, a 2 is given as a solution to the fixed-point equation A37h If ( 1571 ) has 
multiple solutions, the solution minimizing ( 1391 ) should be chosen. 

Proof of Proposition^ We use a technical lemma, presented in the end of this section, to prove Proposition [2] 
See Appendix [F] for the details. ■ 
This result implies that the asymptotic equivalent channel between user k and the associated decoder looks like 
the SIMO channel (f27T > in terms of the spectral efficiency. In other words, the performance loss caused by coding 
the data streams for each user separately vanishes in the large-system limit, as shown in ITTI for the perfect-CSI 
case. We remark that it is relatively easy to evaluate (RTt numerically, by using y f fc m ~ z t k m in d40b conditioned 

on Zr T ,k,m and b t,k, m - 

The prefactor (1 — t/T c ) corresponds to the rate loss due to the transmission of pilot symbols. The spectral 
efficiency C T+1 (of r (T), a 2 ) grows with the increase of r since the channel estimation improves, while the prefactor 
decreases. Thus, there is the optimal length of the training phase to maximize the spectral efficiency of the optimum 
separated receiver, as shown in Section [VTl 
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We shall present a sketch of the derivation of Proposition Q] The derivation of Proposition Q] consists of two 
parts: the analysis of the channel estimator and the analysis of the optimal detector. The goal in the analysis of the 
channel estimator is to prove the following lemma. 

Lemma 1. Let 1-L k = {h^jfl, : for all m} be replicas of the channel vectors TLk for a G {1, . . . , n}, with a 
natural number n: {"Hi } are independently drawn from p{Wk) for all k. Suppose that {Ak\ are mutually disjoint 
subsets of {2,3, ... ,n} for all k G /C. We define a random variable X k (T-L^ , 1% ; 0) G K with a set of fixed 
parameters as 

X k {H\}\l T ^)= f f k (H { k 1} ,H£»;e) [] {p(nl a} \l T ,)dn { k a} }, (42) 

J aeA k 

with = ■ a € A k } denoting the set of the replicated channel vectors associated with indices A k . In 

\42\ . fk('H^ 1 \'H' k ^ k ; 0) € K is a deterministic function ofhtj. 1 ^, l~L' k k t and 0. Suppose that the joint moment 
generating function of {-XfcCHi \^T T ;©) : k G /C} exists in the neighborhood of the origin. Then, 

lim p({X k : k e !C}) = T[p(X k =X k ), (43) 

in which 

X k (Hi 1} ,l rT y,e) = j / fe (Hi 1} ,^;0) J] fl {P(hk, m = h { k %\l T ^ kM )dh{ a l} , (44) 

aeA k m=l 

with Xj- t k = k m ■ f or a M m }- I n ( BP . X k and X_ k are abbreviations of (142\) and (144\) . In evaluating i44t , 
o~1 = of r (r) for t G T T is given by the solution to the fixed-point equation h38$ . 

Proof of Lemma [7} See Appendix [B] ■ 
Lemma [T] is used to prove that MAI to the users in JC is self-averaging with respect to lj- r under Assumption Q] 
The natural number n corresponds to the number of replicas introduced in the analysis of the optimal detector. The 
details of the derivation of Proposition Q] are summarized in Appendix [D] 

C. One-Shot CE-MUDD 

We present an analytical expression for the spectral efficiency d20l ) of the one-shot CE-MUDD. The expression 
is straightforwardly obtained from Proposition [2] 

Proposition 3. Suppose that Assumption Q] and the RS assumption hold. Then, the spectral efficiency ( |20| ) of the 
one-shot CE-MUDD is given by 



K.L^oo 

fce/c 



M 



T 



^Hm^ C7 „c = @M I 1 — — 1 J C T+1 {ai{r), ^ (45) 

in the large-system limit, in which C_ T+1 (a^ r (T), a\ (k)) is defined as A34h In evaluating the integrand, a\ = oj r (j) 
for t £ T T is given as the solution to the fixed-point equation rti<S| ). On the other hand, ct^(k) satisfies the fixed-point 
equation 



{P/M)e + ^(n) N \ {P/M)e + <x c 2 (k) 



2 



E 



l^l,ll| 2 |^r+l,l,l - (br+1,1,1, 



2 



(46) 
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with hi i — h.i\ and £ 2 = £ 2 ((7 2 r (T), r) given by ( 1371 ) ana! ( 1321 ), respectively. If the fixed-point equation M6\ has 
multiple solutions, one should choose the solution minimizing the following quantity 



f3M 



*I(h 1A ;y ^+1,1,1,1^,1,1) + i 1 ~ «)^r+i,i,i,^i,i;y T+1 ! JIr.,1,1) + ^W>||o*(k))- (47) 



The integrand in (|45]> is equal to the asymptotic spectral efficiency of the optimum separated receiver for a 
MIMO DS-CDMA system in which (1 — k)K users send data symbols and kK users transmit symbols known to 
the receiver in symbol period r + 1. The symbols known to the receiver corresponds to the data symbols decoded 
successfully in the preceding substages. The factor 1 — n appears in the last term of the right-hand side of d46l >. 
since the system load decreases effectively as successive decoding proceeds. 

Proof of Proposition [3} The quantity C T +i.k in the spectral efficiency (l20b of the one-shot CE-MUDD 
corresponds to the spectral efficiency of the optimum separated receiver for a MIMO DS-CDMA system in which 
the first (k — 1) users transmit known symbols {pt.k',m} for k' = 1, . . . , k — 1 in symbol period t(> r). We have 
already evaluated the spectral efficiency in Proposition [2] when all users transmit data symbols in symbol period t. 
We do not use the statistical property of each data symbol b t .k. m in the derivation of Proposition [2] Furthermore, the 
derivation still holds even if the noise variance of for the single-user SIMO channel d27l ) depends on k. For some 
natural number K , consider the large-system limit in which K, L, and K tend to infinity while j3 = K/L and 
Ko = Ko/K are kept constant. Replacing the prior of b t ^'.m for k' = 1, . . . , k — 1 by Prob(6t.fc' jm = Pt,k',m) = 1 
and Prob(&t,fc', m 7^ Pt,k',m) — in substage k > Kq, we find that the fixed-point equation d37b reduces to d46b 
with k = k/K, since E[||hJ T 1 || 2 |6 t , fe ',i - (b t ,k',i)\ 2 ] = for k' = 1, . . . , k - 1. Similarly, the quantity ([39]) for 
t = t + 1 reduces to d47| i. These results imply 

r 



C onc =0 [I T 



1 Kn ~ l M K 



K ^ K 

fe=l k=K, 



(48) 



in the large-system limit. In evaluating the second term, <7 2 r (r) is given as the solution to the fixed-point equa- 
tion (|38l . On the other hand, ct 2 (k) satisfies the fixed-point equation d46| i. If the latter fixed-point equation has 
multiple solutions, the solution minimizing ( f4Tb should be chosen. 

The definition of the Riemann integral implies that K~ x Y^=K * n tne s 600110 ! term °f < l48b becomes J K dn in 
the large-system limit. Therefore, we arrive at Proposition [3] by taking kq — > 0, since the first term in ( |48| i vanishes 
in that limit. ■ 

D. Joint CE-MUDD 

We evaluate the spectral efficiency (fT&t of the joint CE-MUDD in the large-system limit. An analytical expression 
of the spectral efficiency ( TToT ) is immediately obtained from Proposition [3] 

Proposition 4. Suppose that Assumption Q] and the RS assumption hold. Then, the spectral efficiency ( 1761 ) of the 
joint CE-MUDD is given by 

lim C joint = ^ £ f 1 C t (al(t-l),a 2 c (K))dK, (49) 

K,L— ^00 l r L — ' / n 
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in the large-system limit, in which C_ t {a^ r {t — 1), <?;?(«)) is defined as In evaluating the integrand, of r (t — 1) 
is given as the solution to the fixed point equation $38} with r = t — 1. On the other hand, ct^(k) satisfies the 
fixed-point equation \46\ with h\\= h\\ and £ 2 = £ 2 (of r (t — 1), t — 1) given fey ( 157 1 ) one/ ( 1521 ), respectively. If 
the fixed-point equation \46\ has multiple solutions, the solution minimizing \47\ should be chosen. 

Derivation of Proposition^ The quantity C t ,k m ( H"6l > corresponds to the spectral efficiency of the one-shot 
CE-MUDD for the MIMO DS-CDMA channel © with the first t — 1 symbol periods as a training phase. Applying 
Proposition [3] with r = i — 1 to ( fl6b , we obtain (|49l >. ■ 
The spectral efficiency ( |49t does not decrease with each increase of r. Furthermore, the integral in d49l > is equal 
to zero for t = 1, since i = 1 implies no pilots. Therefore, the spectral efficiency d49b is maximized at r = and 
t = 1 if the integral in $4% is strictly positive for t > 2. This observation implies that the joint CE-MUDD can 
reduce the training overhead significantly. 

Remark 3. We have so far considered the equal power case. One interesting issue would be temporal power 
allocation. It is straightforward to extend Proposition^\to the temporally unequal power case. Intuitively, allocating 
much power to around the beginning of one fading block improves the accuracy of the channel estimates, while 
power used for transmission of data symbols around the end decreases. Thus, it is not straightforward to find the 
temporally optimal power allocation. This power allocation issue is left as future work. 

Remark 4. It is possible in principle to evaluate the spectral efficiency of joint CE-MUDD based on nonlinear 
channel estimation in the large-system limit, in which the channel estimator sends the marginal posterior pdfs 
{p(Hk\3ht-ii3-c t+1 )} to the optimal detector in stage t, instead of (I77D. The spectral efficiency would be given as 
that of a receiver with the optimal nonlinear channel estimator for the single-user SIMO channel ( |27| ). However, 
the obtained spectral efficiency is difficult to calculate in terms of the computational complexity. In this sense, the 
bound based on nonlinear channel estimation is not an analytical one, while the formula itself is simple. 

E. LMMSE Receiver 

The LMMSE receiver for M = N = 1 has been analyzed rigorously in ll34l by using random matrix theory. In 
this section, we show that the replica method can derive the same result as the rigorous one. 

The spectral efficiency d25l ) of the LMMSE receiver is given via the spectral efficiency of a bank of the single-user 
SIMO channels (|27]) with an LMMSE receiver. We first define the LMMSE receiver for the SIMO channel d27b . 
after which we present an analytical expression for the spectral efficiency d25l > of the LMMSE receiver in the 
large-system limit. The LMMSE receiver is obtained by replacing the optimal detector in Fig. [5] by the LMMSE 
detector. 

Definition 3. The LMMSE detector estimates the data symbol bt,k.m from the received vector y for the single- 
user SIMO channel \27\ in the same symbol period t and the information about hk, m provided by the LMMSE 
channel estimator defined in Definition [2] and feeds the information about 6t,fc,m fo the per-stream decoders. In 
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order to define the information about bt,k,m> we consider the single-user SIMO channel in the tth symbol period 
postulated by the LMMSE detector, 

f .(L) ;,( L ) +1 7,( L ) +~( L ) 

with f^l ~ CjV(0, cr| Jjy). In ( 1501 ), /i fc T m denotes the LMMSE channel estimate ( 1371 ) for the single-user SIMO 
channel \27\ . The vector W^f. m € C w conditioned on Ij- k m is a CSCG random vector with the covariance 
matrix {P/M )S 4 fe m , in which S t fc m denotes the covariance matrix of the channel estimation errors Ah[ T m — 
hk,m — \kkrn' i- e -> l±t k m = ^[^tk^m (^tkk^n ) H \^-T T k m\- The information about bt,k,m is defined as 

P^kJy^Ln = y t ,k,m^r T ,k,m) = J P^Ln^ulm = Mi,fc,m ' —T T ,k,m)^^tJc,m^ (51) 

with 



/7(L) - (L) I (L) T \ ^(yt,fc,ml^t,fe,m'^t,fe,-m'^fe,m)^(^t,fe,m)^(^t,fe,m) ,™ 



;m\». t! k, m '-'T,k,mJ (L) ,r(L) _ (L) f - ^ v , - (L) v ,7(1.) ,7(L) 

where p(yfk m \^tk m'—tk -m'^fem) denotes the single-user SIMO channel ( 1501 ) postulated by the LMMSE detector. 

Proposition 5. Under the RS assumption, the spectral efficiency ( 1251 ) of the LMMSE receiver converges to the 
spectral efficiency of the LMMSE receiver for the single-user SIMO channel H27\l with in the large-system limit: 



lim C h = pM 1-- )C T+1 (at T (r),ai), (53) 

where C_ T+1 (a 2 r (T),a^) is given by H34i . In evaluating ( 1531 ), o^ t {t) f or t & T T is given as the solution to the 
fixed-point equation (\38h On the other hand, cr£ satisfies the fixed-point equation 



N 



', % I|2|l /r(L) \_ ,2 



\hk,m\\ \Kk,m - (b t ,k, m }^ 



(P/M)Z 2 +al N \(P/M)£ 2 +o 
with e = e^Ur\ r) and{...) L =J. ■ ■ p(6$, m ,«$,J»S m = £ fcm ,Zr T A J^M^It 



(54) 



Note that the LMMSE receiver can achieve the constrained capacity of the single-user SIMO channel (f2Tb . The 
fixed-point equation d54l l coincides with that in |34| for M — N = 1. This implies that the result obtained by 
using the replica method is correct for the LMMSE receiver. The difference between the fixed-point equations ( f37b 
and ( f54b appears in the last terms of their respective right-hand sides. The last term of the right-hand side of ( 1541 ) 
corresponds to the mean-squared error (MSE) of the linear MMSE estimate of bt,k, m f° r the single-user SIMO 
channel ( f40b with perfect CSI at the receiver, whereas the last term of the right-hand side of ( f37T > corresponds to 
the MSE of the minimum mean-squared error (MMSE) estimate of b t ^,m for the same SIMO channel. 

VI. Numerical Results 

The spectral efficiency of the joint CE-MUDD is compared to the spectral efficiencies of the one-shot CE-MUDD, 
the optimum separated receiver, and the LMMSE receiver, on the basis of Propositions |2]-[5] See Table Q] for the 
differences between the four receivers. The performance gap between the joint CE-MUDD and the one-shot CE- 
MUDD corresponds to the gains obtained by using the decoded data symbols to refine the channel estimates. The 
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Fig. 6. Symbol error probability of 6i,t,i versus P/No for the LMMSE receiver. From top to bottom, {+} denote the symbol error 
probabilities of the LMMSE receiver for (K, L) = (8, 4), (8, 8), (8, 16). {□} represent the symbol error probabilities of the LMMSE receiver 
for (K, L) = (16, 8), (16, 16), (16, 32) from top to bottom. The solid lines represent the symbol error probabilities of the LMMSE receiver 
in the large-system limit for /3 = 2.0, 1.0, 0.5 from top to bottom, r = 4 and M = N = 1. 

gap between the one-shot CE-MUDD and the optimum separated receiver is related to the gains obtained by using 
the decoded data symbols to mitigate MAI. The performance gap between the optimum separated receiver and 
the LMMSE receiver corresponds to the gains obtained by performing the optimal MUD, instead of the LMMSE 
MUD. The spectral efficiencies of the three receivers based on the one-shot channel estimation are well-defined for 
r > 0. For clarity, these spectral efficiencies are calculated for r > 0, while the spectral efficiency d49i l of the joint 
CE-MUDD is evaluated for r = 0, 1, . . . ,T C . In all numerical results, unbiased QPSK input symbols are used. 

We first present Monte Carlo simulation results for finite-sized systems and compare them with the analytical 
predictions derived in the large-system limit. In the Monte Carlo simulations we assume QPSK spreading, which 
is performed by using two mutually independent binary-antipodal random spreading sequences for in-phase and 
quadrature-phase channels. We only consider the LMMSE receiver since the computational complexity of the 
optimal detector is high. Figure [6] plots the symbol error probability of &i.t,i for M = N = 1 and r = 4. The 
solid lines represent the symbol error probability of the LMMSE receiver for the single-user SIMO channel (|27| > 
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Fig. 7. Spectral efficiency versus r/T c in the large-system limit. The pluses connected by straight lines display the spectral efficiency of 
the joint CE-MUDD. The dashed lines, the solid lines, and the dotted lines represent the spectral efficiencies of the one-shot CE-MUDD, the 
optimum separated receiver, and the LMMSE receiver, respectively. The fine dotted straight lines shows an upper bound of the constrained 
capacity (3) based on the optimal receiver with perfect CSI. P/Nq = 6 dB, M = N = 1, and T c = 20. 



in which a\, for t' € T r is given by the solution to the fixed-point equation d38l > and in which erf is given as the 
solution to the fixed-point equation d54"l >. We find that the analytical predictions are in agreement with the Monte 
Carlo simulation results for K = 16, while they are slightly different from the simulation results for K = 8. 
This result has two consequences: One is that the asymptotic results for the LMMSE receiver are applicable to a 
non-Gaussian distribution of si t t,k,m> as noted in [34]. The other is that the convergence of spectral efficiency to 
its asymptotic value is so fast that our analytical results, especially for the LMMSE receiver, provide reasonably 
good approximations of the true spectral efficiencies even for small-sized systems. The latter observation has also 
been made in 03), ll60l . 

We next focus on the asymptotic spectral efficiencies for single-antenna systems. Figure [7] displays the spectral 
efficiencies of the four receivers for P/N = 6 dB. An upper bound (1 — t /T c )C^ t ^ of the constrained capacity (f5]) 
is also shown, which is based on the asymptotic spectral efficiency C^p"' of the optimal receiver with perfect CSI 
ifTTl . We find that the spectral efficiencies of the one-shot CE-MUDD and the two separated receivers are maximized 
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Fig. 8. Spectral efficiency versus t/T c in the large-system limit. The pluses connected by straight lines display the spectral efficiency of 
the joint CE-MUDD. The dashed lines, the solid lines, and the dotted lines represent the spectral efficiencies of the one-shot CE-MUDD, the 
optimum separated receiver, and the LMMSE receiver, respectively. The fine dotted straight lines shows an upper bound of the constrained 
capacity (3} based on the optimal receiver with perfect CSI. The widely-spaced dotted lines denotes solutions for the optimum separated receivers 
which are not selected by the criterion = 2.75, P/N Q = 15 dB, M = N = 1, and T c = 20. 



at optimal r = r opt . This observation results from two effects: the improvement of the accuracy of the channel 
estimation and the decrease of the number of transmitted data symbols, both caused by the increase of r. The 
spectral efficiency (|49| | of the joint CE-MUDD is maximized at r = and r = 1. These results imply that the 
training overhead can be significantly reduced by using joint CE-MUDD. 

The performance gaps between the one-shot CE-MUDD and the two separated receivers are negligibly small for 
(3 = 0.5, while there is a noticeable gap between the spectral efficiencies of the joint CE-MUDD and the one-shot 
CE-MUDD. We found by numerical evaluation that P/N ps 8.2 dB is required for /3 — 0.5 in order for the 
one-shot CE-MUDD to achieve the same spectral efficiency as that of the joint CE-MUDD for P/Nq — 6 dB. 
In other words, the joint CE-MUDD provides a performance gain of 2.2 dB. For (3 = 1.5, the performance gaps 
between the optimal one-shot CE-MUDD, the optimum separated receiver, and the LMMSE receiver are large. 
This result implies that performance gains can be obtained by using an MUDD scheme with higher performance 
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than the LMMSE receiver. Furthermore, there is a large performance gap between the joint CE-MUDD and the 
one-shot CE-MUDD. More precisely, we found that P/Nq « 8.5 dB is needed for (3 — 1.5 in order for the one- 
shot CE-MUDD to achieve the same spectral efficiency as that of the joint CE-MUDD for P/Nq = 6 dB. These 
observations indicate that joint CE-MUDD can provide significant performance gains regardless of j3, compared to 
one-shot CE-MUDD. 

Figure [8] shows the spectral efficiencies of the four receivers for P/Nq = 15 dB and j3 = 2.75. The upper bound 
(1 — r/T c )C„pt is also shown. One interesting observation is that the spectral efficiency of the optimum separated 
receiver is discontinuous. This result predicts that the spectral efficiency of the optimum separated receiver exhibits 
a waterfall behavior: The spectral efficiency rapidly degrades in the neighborhood of the discontinuous point r = r c , 
shown by the arrow, which corresponds to the threshold between interference-limited and non-limited regions. The 
system is not interference-limited for r > r c , i.e., the asymptotic multiuser efficiency No/a^ is close to one. On 
the other hand, the system is interference-limited for r < r c , i.e., the asymptotic multiuser efficiency is small. 

The spectral efficiency of the one-shot CE-MUDD seems to be continuous. This observation is explained as 
follows: Users decoded in the initial substages of successive decoding are interference-limited, while the remaining 
users are not interference-limited. Therefore, the achievable rate of each user changes discontinuously for the one- 
shot CE-MUDD. However, the achievable sum rate of the one-shot CE-MUDD changes continuously since the 
threshold k c between the two groups of interference-limited users and non-limited users should move continuously 
with the change of r. We remark that it might be possible to cancel out MAI successfully by optimizing the power 
allocation and the rate of each user, as discussed in |26|. 

Finally, we investigate multiple-antenna systems. Figure [9] shows the spectral efficiencies of the four receivers 
for M = N — 8. The upper bound (1 — r /T c )C^pl is also shown. We find that the optimal number r opt of 
pilot symbols is larger than that for M = N = 1 (Compare Figs. [7] and |9), since the number of unknown channel 
coefficients increases. On the other hand, the spectral efficiency of the joint CE-MUDD is maximized at r = and 
t = 1 even for M = N = 8. Consequently, the performance gap between the joint CE-MUDD and the one-shot 
CE-MUDD becomes larger than that for single-antenna systems. When j3 = 0.5, the performance gap between the 
joint CE-MUDD and the one-shot CE-MUDD is approximately 1.47 bits/chip for M = N = 8 (0.18 bits/chip per 
the number of antennas), while the performance gap is approximately 0.10 bits/chip for M = N = 1. We found 
by numerical evaluation that a performance gain of 1.47 bits/chip corresponds to that of approximately 3.4 dB. For 
(3 = 1.5, the performance gain increases up to approximately 2.88 bits/chip for M = N = 8 (0.36 bits/chip per the 
number of antennas), while it is 0.35 bits/chip for M = N = 1. Interestingly, a performance gain of 2.88 bits/chip 
corresponds to a gain of 7.2 dB. These results imply that joint CE-MUDD can provide a significant performance 
gain for multiple-antenna systems. 

Figure [lO] shows the optimal training overhead r opt /T c for M = N, maximizing the spectral efficiencies of the 
one-shot CE-MUDD and the two separated receivers. Note that the optimal training overhead for the joint CE- 
MUDD corresponds to 1/T C = 0.05. We find that the optimal training overheads increase logarithmically with the 
increase of the number of antennas for small M, while they tend towards around 0.5 for large M. This observation 
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Fig. 9. Spectral efficiency versus t/T c in the large-system limit. The pluses connected by straight lines display the spectral efficiency of 
the joint CE-MUDD. The dashed lines, the solid lines, and the dotted lines represent the spectral efficiencies of the one-shot CE-MUDD, the 
optimum separated receiver, and the LMMSE receiver, respectively. The fine dotted straight lines shows an upper bound of the constrained 
capacity {3) based on the optimal receiver with perfect CSI. P/Nq = 6 dB, M = N = 8, and T c = 20. 



implies that the performance gains obtained by using joint CE-MUDD increase as the number of antennas grows. 

VII. Conclusions 

We have analyzed the asymptotic performance of joint CE-MUDD for randomly-spread MIMO DS-CDMA 
systems with no CSI. The main contribution of this paper from a theoretical point of view is to derive a lower 
bound for the spectral efficiency of the optimal joint CE-MUDD on the basis of successive decoding with suboptimal 
LMMSE channel estimation, along with the spectral efficiencies of the one-shot CE-MUDD and the optimum 
separated receiver. The asymptotic performance of MIMO DS-CDMA systems with no CSI is characterized via 
the performance of a bank of single-user SIMO channels with no CSI. This decoupling result is an extension of 
previous studies ifTTl . Il20l for the case of perfect CSI at the receiver to the no-CSI case. 

The main contribution of this paper from a practical point of view is to demonstrate that joint CE-MUDD can 
significantly reduce the training overhead due to transmission of pilot signals, compared to receivers based on 
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Fig. 10. T pt/T c versus M(= N) in the large-system limit. The dashed lines, the solid lines, and the dotted lines represent the optimal 
training overhead of the one-shot CE-MUDD, the optimum separated receiver, and the LMMSE receiver, respectively. P/No = 6 dB and 
T c = 20. 



one-shot channel estimation. The results imply that joint CE-MUDD can provide significant performance gains for 
systems which require large training overhead for attaining accurate CSI. We conclude that the iterative refinement 
of channel estimates by utilizing soft feedback from per-user decoders may lead to substantial reduction of the rate 
loss due to transmission of pilot signals for MIMO DS-CDMA systems. 

Appendix A 
Derivations of LMMSE Estimators 

A. Perfect-CSI Case 

We shall present two derivations of the LMMSE estimator for vector channels with perfect CSI. One is based 
on the minimization of the MSE, and the other on Bayesian inference. Let us consider the N x K vector channel 
with perfect CSI 

y = Hx + n e C N . (55) 
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In d55l l, x G C R denotes a zero-mean transmitted vector with covariance matrix E[cc:r i? ] = PI k- The noise vector 
n G C N is a zero-mean random vector with covariance matrix S. Furthermore, H G i^nxk denotes a random 
channel matrix with mean H G C NxK . The three random variables H, x, and n are independent of each other. 

The goal is to find the LMMSE estimator, i.e., a linear estimator x = F pcr y of the transmitted vector that 
minimizes the MSE E[||a: - &|| 2 |H]. Substituting x = F pcr y into the MSE E[||a; - &|| 2 |ff] yields 

E[||aj - xf\H] = P\\I K - F^ CI H\\ 2 + Tr (jf er £F per ) . (56) 

It is well known that F pcl to minimize the MSE d56b is given by 

F per = S- 1 Jf(p- 1 J ic + H"^ 1 ^- 1 . (57) 

In the derivation of d57i i. we have minimized the MSE (l56l l among all possible linear estimators x — F pcr y. 
The MSE d56l l for the LMMSE estimator is larger than the MMSE in general, which is the minimum of the MSE 
for all possible estimators. However, it is known that the two MSEs coincides with each other when the transmitted 
vector x and the noise vector n are CSCG. This observation implies that the LMMSE estimator can be derived 
as the posterior mean estimator for a Gaussian vector channel. Let us define the vector channel postulated by the 
receiver as 

2/per = Hx pcl + n pcr , (58) 
with x pcl ~ CAf(0, PI k) and n pcr ~ CAf(0, X). It is straightforward to confirm that the posterior mean estimator 
^[^per|y per = V ■• H] f° r tne postulated vector channel is equal to the LMMSE estimator. 

B. No-CSI Case 

We consider the no-CSI case: The channel matrix H is assumed to be unknown to the receiver, while the 
statistical properties of x, H, and n are known. The LMMSE estimator is defined as the linear estimator x = F H y 
to minimize the MSE E[||cc — x\\ 2 ] averaged over the channel matrix H. Substituting x = F H y into this MSE 
gives 

E[\\x-x\\ 2 } = P\\I K - F H H\\ 2 + Tr^F H (PE[AHAH H }+?^ f\ , (59) 
with AH = H — H. Comparing this expression with (l56l l. we find that the optimal filter F is given by 

F = V-IH{P- 1 I K + H H H-^H)- 1 , (60) 

with S Bf = PE[AHAH H ] + S. 

In order to interpret the meaning of the effective covariance matrix S c f, we re-write the vector channel 05] ) as 

y = Hx + AHx + n. (61) 

The effective covariance matrix S c f is equal to the covariance matrix of the channel estimator error plus the noise 
vector, i.e., AHx + n. This observation implies that the LMMSE estimator is equal to the posterior mean estimator 
E[jc|j/ = y] for the postulated vector channel, 

y = Hx + w + h, (62) 
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where x £ C K , w € C N , and n £ are independent CSCG random vectors with covariance matrices PIk, 
PK[AH AH H ], and E, respectively. If H = O, the LMMSE estimator is obviously independent of the received 
vector y. 

Appendix B 
Proof of LemmaQ] 

In order to prove Lemma [TJ we need the following lemma, which is derived by extending the replica analysis in 



Lemma 2. Suppose that {Ak} are mutually disjoint subsets of {2, 3, . . . , n} for k € JC. Then, 



lim Ex T 

K,L-+oo 



keK aeA k 



A I 



keK. m=l 



ae{l}uA k 



(63) 



with % 



{a} 
K 



{%£ : fc € JC} denoting the ath replicas of the channel vectors for the users in JC. In evaluating the 



right-hand side of ( 1631 ), af = o~( t {t) f or t £ T T is given by the solution to the fixed-point equation 

Derivation of Lemma |2} See Appendix [C] ■ 
Lemma [2] implies that the information about {hk, m } obtained by utilizing the information If r is mutually 
independent in the large-system limit for all k £ JC and m, and that the information about each hk,m looks like that 
provided by the LMMSE channel estimator for the single-user SIMO channel ((27). A result equivalent to Lemma|2] 
was proved in [34] for M — N = 1, by using random matrix theory. Lemma|2]is used to evaluate the joint moment 
sequence of {Xk : k £ JC}, defined by d42l . in the following proof of Lemma [TJ 

Proof of Lemma [7} It is sufficient to prove that the joint moment sequence of {Xk : k £ JC} converges to 
that for the distribution defined by the right-hand side of d43l in the large-system limit, since we have assumed the 
existence of the joint moment generating function. 

For non-negative integers {fik}, a joint moment of {Xk ■ k £ JC} is given by 



E 



keK 



= E 



iniw^i 9 ) n faw^)^} 

keK i=i J aeA M 



(64) 



where {Au : for all k £ JC, i} are mutually disjoint subsets of {2, 3, ... , n} satisfying \Au \ = \Ak\- The right- 

- Ill A 

hand side of d64b is defined as the integration of Jl/ceK; 11™= l /*C^k i^k * ! ®) w i m res P ect to the measure 



E 



^7V 



pw£ } \i Tr )ji n p(u { k a} \i TT ) 

fee'C aeAk 



tup n n dn i a} > 

fce'C aeA k 



(65) 



with Ak = U™^ 1 ^4^' ) . Applying Lemma [2] to d65l l. we obtain 



lim E 

K,L-yoo 



keK 



keK 



flk r (*) M 

n / Mn { k i} X k ;©) n n {f(^=^:ii^, fc , ro )^s} 

1=1 m = 1 aeA[: ) 



(66) 
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in the large-system limit, which is equal to the corresponding joint moment H feeJC E[X **] f° r tne distribution 
defined as the right-hand side of ( |43T >. ■ 

Appendix C 
Derivation of Lemma[2] 

A. Replica Method 

We present a brief introduction of the replica method. For details of the replica method, see Il37l - ll39l . Let 
f(X,Y;N) > denote a deterministic function of two random variables X and Y with a parameter N, Our goal 
is to evaluate the expectation E Y [Z(Y; N)' 1 ] for the so-called partition function Z(Y; N) = Ex[f(X,Y; N)} in 
the limit N — > oo. For that purpose, we first evaluate limjv-voo Ey [Z(Y; A)™ -1 ] for any natural number h € N, 
by utilizing the following expression 



l Y [ZiY-^Nf- 1 ] = E 



Y[f(X a ,Y;N) 



a=l 



(67) 



where {X a : a = 1, . . . , n — 1} are i.i.d. replicated random variables following p(X). Suppose that the analytical 
expression obtained via ( f67b is well-defined even for iiel. We take h — > +0 to obtain an analytical expression of 
lim J /v_ i . 00 Ey [Z(Y; A) -1 ], assuming that the obtained expression coincides with the correct one. It is a challenging 
problem to prove whether this assumption holds. 



B. Formulation 

In order to show Lemma [2] by using the replica method, we transform the left-hand side of (l63l l into a formula 
corresponding to Ey [Z(Y; A)™ -1 ]. The posterior pdf of the replicated channel vectors for the users in JC, 

defined in the same manner as in (fTTT i. is given by 

/ U.U pQ>t \n {a} , St , u t ) p (hm )du\ a ^ 



p(n { K a} \iT T ) 



(68) 



with H {a} = {H { k a} : for all k} and = {U { k a} : for all k £ JC}. In ([68]>, the pdf p{y t \H {a} ,S t Mt) is an 

abbreviation of p{yt\H = H^ a \St,Ut). Let %■£ = {H k 7 k : k G JC} denote the replicas of the channel vectors 
for A = UkeicAk, with H h Ak = {u{ a} : a G Ak}- Applying (|68]> to the left-hand side of d63]) and subsequently 



introducing a real number n € K, we obtain 



Urn 



K.L- 



k£K aeA k 



lim lim S fi (^ 1} ,^^), 



(69) 



where H fi (?{£- 1 ',%;£) i s given by 



2fi(^r , "H^) — E 




fi-l-l^l 



a£{0,l}Uyt U=i J t=i 



-t/{!} -1/.A 



(70) 
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In ( TTOt , we have written H as %^ . Furthermore, the set denotes all replicas of the channel vectors for a > 2 
except for Hg. The expression d70t implies that Yit=i pO^I^i StMt) corresponds to the function f(X, Y; N) with 

x = n. 



C. Average over Quenched Randomness 

We evaluate <T70b up to 0(1) in the large-system limit for any natural number n, satisfying A C {1,2, ... ,n}. 
We first calculate the expectations in (TTOb with respect to {3^t} and {St}- For n £ N, we have a special expression 
of© 



S fi (^ 1} ,^)=E 



II X\p{yt\H {a \s u u t ) P {u^) \J{dy t du^du { ^d\ni 

a=0 [t=l J t=l 

Let us re-write the MIMO DS-CDMA channel (O in the training phase as 

K t M 



-T/i 1 } <ijA 



v ^ fc=l t=l m=l 



(71) 



(72) 



with y ; = (y^i, • ■ • , yf T ) T ^ C Nt and rt/ = (n^, . . . , nf T ) T g C JVt . By evaluating the expectation in ( TTTb with 
respect to {St}, from the independency of {si t t,k,m} f° r all ^ <ED yields 



S s (4 1} ,^)=p(4 1} )p(^)E 



E 



n 



-i/i 1 } <ijA 



where the inner expectation is taken over {s\,t,k,m '■ f° r a U U K m}. In (|73"T >. 6 C Wt is defined as 

K T M 



V 11 r> i + i ™ i 



,{«} 



fe=l t=l m=l 



where e 



is given by 



, (73) 



(74) 



(75) 



with h { k °l = h k<m . Let us define v E C^ +1 ^ Nt as v = ((v {0} ) T , . . . , {v^) T ) T . We perform the Gaussian 
integration in d73l with respect to j/j to obtain 



E 



-u H A T (n)u 



(7rAf ) flArT (l +n) 



with 



A T (n) 



N Q + nN 



I Nt . 



(76) 



(77) 



-Is (1 + n)Jn - l ft 1? 

The CSCG assumption of {si t t,k,m} implies that v in (l76l l conditioned on {"H^ : for all a} and A" is a CSCG 
random vector with the covariance matrix 



K t M 



(78) 



fe=l t=l m=l 
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with u)k,t,m = (( u 'l /m) T ' ■ • ■ ' ( u; I't^m) T ) T - Let us assume that Q is non-singular. Then, d76l ) yields 



,LG T (Q;h) 



-t/{!} njA 



with 



G r (Q;n) 



lndct(J + A T (h)Q) - nNr ln(7rA ) - A/Yln(l + n) 



(79) 



(80) 



Note that d79l holds for any K and L. Without the assumption of sij.k.m ~ CAf(0, 1), it would be necessary to 
evaluate lnE[cxp(— « ff A T (n)v)\ up to 0(if _:L ) in the large-system limit by using the Edgeworth expansion, since 
d63l is a quantity of O(l). The vector v conditioned on {Ti.^ : for all a} and X converges in law to a CSCG random 
vector with the covariance matrix d78l ) in the large-system limit if one assumes that {Re[si t t.k.m], Im[si,t,k,m]} 
are i.i.d. zero-mean random variables with variance 1/2 and finite moments for all I, t, k, and m. Therefore, the 
expansion coefficient for 0(1) should coincide with d80b . Furthermore, it was shown that the expansion coefficient 
for 0(A' _1 ) does not affect the result in the large-system limit for conventional DS-CDMA channels (6T). Therefore, 
it is expected that the results in this paper holds for a general distribution of s^t.k.m- 



D. Average over Replicated Randomness 

The conditional expectation in d79l is given as the integration of exp[LG T (Q;h)] with respect to the measure 
Hk(Q', n)dQ on the space ■Mf fi+1 s NT of all (n + 1)Nt x (n + 1)Nt positive definite Hermitian matrices: 



VK(Q;n)dQ, 



(81) 



M; 



where /«K-(Q;n) denotes the pdf of Q conditioned on and H^, induced from d78l ). We shall calculate 

Hk(Q', n) up to 0(1) since our goal is to evaluate (T70b up to O(l) in the large-system limit. 
We first evaluate the the moment generating function of ( |78l l, defined as 



M K {Q-n) =E 



'lyl^} ay. 



(82) 



where an (h + 1)Nt x (n + 1)Nt non-singular Hermitian matrix Q is given by 

^ Qo,o 1^0,1 ' ' ' \Qo,h \ 

2^n— I,- 

\2^0,n ' ' ' %Qh— l,n Qh,n J 

with At x At complex matrices {Q a b : < a < b < h}, in which the (j, fc)-element of Q a b is given as 



Q = 



(83) 



qfc'l ^ anc ^ w i tn ^ r x Hermitian matrices 



1=1,2 



WalT 



1(~Nt-1,Nt\* 
2v"a,a / 



2 y a,o 



1^Wt-1,JVt 
2 "a, a 

~Nt,Nt 
"a, a / 



for all a. 



(84) 
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Since {Xk = {xt r k,m '■ for all t, m}} and {Hi : a — 0, . . . , n} are mutually independent for all k, substituting 
<[V8j into (182) implies that (182) is decomposed into the product of K terms: 



M 



K 



(c;»)= n M fc(Q;")n E [ ( 



Ak(Q;n) 



fce/c 



with 



(85) 



(86) 



T M 



(87) 



M k (Q;h) = E 
In these expressions, Afc(Q;ri) is given by 

A fc (Q;n) = ]T £ Tr( UiAmU ^ ra Q) 

t=l m =l 

It will be shown later that the moment generating function ( f86l > for the users in /C reduces to the right-hand side 
of (63). 

We next calculate the pdf HK{Q',n) up to 0(1). The inversion formula of the moment generating function 
implies 



2ni 



[(h+l)Nry 



l[M k (Q-fi)\e- KI ^ Q ^dQ : 



(88) 



where the integrations with respect to dRe[q^ b ], dlm[q^ b ], and dq^ a are taken along imaginary axes, respectively. 
In (88), Ik(Q, Q; ra) is defined as 

1 



I K (Q, Q\ n) = TY(QQ)--Y, lnE 



a A fc (Q;n) 



(89) 



where Ak{Q;n) is given by (87). Note that the limit Hirk^oo Ik{Q, Q',n) = I(Q,Q;n) exists obviously. 
Applying the saddle-point method to (88) in the large-system limit, we obtain 



2vr / 



[(n+l)Afr] 2 /2 



detV 2 6 /(Q, Q s ;n) 



-1/2 



[l + o{K% (90) 



where Q s is implicitly given as the solution to the fixed-point equation 



T M 



E 



,H pA fc (C;R) 



if->oo if 



fcg/C t=i m=l 



E 



e A fc (S;n) 



(91) 



In 



, V|>/(Q, Q s ;n) denotes the Hesse matrix of I(Q, Q;n) with respect to Q at the saddle point Q s . In 
the derivation of (90), we have used the fact that the Hesse matrix V^/(Q, Q s ; n) is negative definite, due to the 
assumption of the non-singularity of Q. Note that the concavity of (89) with respect to Q at Q = Q s is required 
for the use of the saddle-point method, since the integration in (88) is taken over imaginary axes. 

Finally, we evaluate (ST) up to O(l) in the large-system limit. Substituting (190) into (|8D and subsequently using 
the saddle-point method, we obtain 



Z A (H { K 1} ,H£) = D AP (n { K 1} ) P (H£) n M k (Q s ;h)e- K *^[l + o(K)}, 



(92) 



keic 
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where Q s is a solution to satisfy 



Q s = arginf $(Q;n), (93) 



with 

*(G;n) = /(Q,Q s ;n)-/?~ 1 Gr(Q;n). (94) 

In (g2j), D h = [detV^$(Q s ;n)] _1 / 2 |detV^7(Q s , Q s ;n)| _1/2 , with V 2 2 $(Q s ;n) denoting the Hesse matrix of 
Q(Q', n) with respect to Q at the saddle-point Q s . We have assumed that the Hesse matrix Vq < I ) (Qs; is positive 
definite. Calculating the solution d93l with ( f80b and d89l , we find that Q s satisfies the stationarity condition 

Q = -/3- 1 (I + A T (h)Q)- 1 A T (ti). (95) 

E. Evaluation of Fixed-Point Equations 

It is generally difficult to solve the coupled fixed-point equations d9Tb and d95l ). At this point, we assume that 
the solution (Q B , Q B ) satisfies RS. Note that it depends on models whether the RS assumption holds. Fortunately, 
It is empirically known that the RS assumption is correct when replicated random variables are CSCG. 

Assumption 2 (Replica Symmetry). (Q s , Q s ) is invariant under all permutations of replica indices: 

Qs=[ n , (96) 

Q s =( ®° H ), (97) 

with M. M e C NtxNt , and with Nt x Nt Hermitian matrices Q , Q , Q v Q x , Q, and Q. 

We first solve the fixed-point equation d95l ) under Assumption [2] Evaluating the right-hand side of d95l ) with ( 177b 
and subsequently comparing both sides, we obtain 

Q = -n(£ + n£ )-\ M = (E + nEo) -1 , Q - (S + nS )~ 1 S S~ 1 , Q 1 = Q-T. 1 , (98) 
where So and S are given by 

S = TVo/at. + /3(Q -M-M H + Q), (99) 

S = N I Nt + P(Q 1 -Q). (100) 

Note that S and S + nSo must be invertible since (J + A T (n)Q s ) _1 exists due to the assumption of the positive 
definiteness of Q. Nishimori [57] proved So = S for unfaded CDMA systems. The condition So = S implies 
that So must be invertible, since S _1 exists. In this case we can re-write Q as Q = S _1 (nS _1 + Sg ) _1 S~ . 
We next evaluate e Afc ( Ss; "), assuming that So is invertible. Substituting dSTl ) and d98l into e Afc ( Ss;n ) yields 

T M 

c A fc (Q s ;n) = JJ JJ e < fc , m *oo t , ktm -E5=o("t^m) HE a , (101) 

t=l m=l 
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with S = (nXT 1 + S^ 1 )- 1 and S a = £ for a = 1, . . . , n. In ( flOlI a t , fc , m e C Wt is defined as a t ,k, 
53™=o ^a 1<jJ tkm- I n or der to linearize the quadratic form a^ k m 'Eoat t k,m in ( HOIK we apply the identity 

J Tr^detSo -*' fe ' m ' 
to ( fTOlT ). with y , e C Wt . Then, we obtain 

—t .k.m 



(102) 



with 



Cn = [Tr^detlS^-^S + nSo)}] . 



(103) 



(104) 



In (H03]>, q(y k \^ { k a} ), with £>. = {y : for all t e T T , m} and Q£ 0> = {^\ a k i m : for all t e T T , m}, is defined 



M _ r, ,M 



T M 

4=1 m =\ 

.1°} . -*p 1 ic u„ m ,..{<»} 



(105) 



where g T N{y_ t km ~ w££ m ; S a ) is given by (Q), with wj,fc,m defined as {75]l. 

We solve the fixed-point equation ( l9TT > by substituting ( 1103b into d9TT >. It is sufficient to evaluate Q —M—M H +Q 
and Qi — Q owing to the fact that S and X depend on Q s only through them. Comparing both sides of d9"Tl ), 
we have 



T M 



Qo- m-m»+q =i^j:y: e e 



fc^K: t=l m=l 



E|e A fc(S 8 ;«)| 



,1°> Z,.! 1 ! U 



A"\% m - U]L)) H <i{yMk } ) (v } {v(yM 1} )\ x k ] } n dy 



(106) 



A I 



K^too K 



k£K t=l m=l 



-M 



E 



Uul 1 } - (uj 1 } )) 

\\ t,k,m \ t.k.ml I 



In ( 11061 ) and j 107b . E[exp{A fc (Q s ; n)}] is explicitly given as 



E 



,A fc (S a ;S) 



= CJ M E 



z&m< 0} ){ 



X), 



} ^ 



(107) 



(108) 



Furthermore, (/(<*4lfcm)) f° r a function /(u>j^ m ) of (TTSb denotes the mean of /(w+^ m ) with respect to the 



{i} 



posterior measure q{H^ \y_ k , X^dHj, 1 ^, defined as 

m { k a} \y k ^k) 



i(yM a} )pW { k a} ) 
/^X a} M^ a} )^i a} ' 



(109) 



where <zQ?Jfifc ) is given by ( 1105b . Equations j99l , ( 1 100b , ( 1106b , and ( 1107b form closed equations for S and X, 
and are well-defined for n 6 R. Regarding n in these equations as a real number and taking h — > +0, we obtain 
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the coupled fixed-point equations 



So = N I NT + lim | E E E E [( w gj m - J)«^ m - (u,$J) 

fl r M 

S = JVoI^ + ^ | E E E E [(("Sm - <«S J)("Sm " («S J) J ) 



(HO) 



(111) 



k$K t=l m=l 

where the expectations in (II 10b and (111 lb are taken with respect to the measure q{y_ k \^ k °^)p('H k ^)p(X k )dy_ k 
dU { k } dX k . Note that ( TTTOl and ( ITTTb are coupled since their second terms depend on both So and S. 

Let us assume So = S. Then, it is straightforward to show that the coupled fixed-point equations dl 10b and (II 1 11 1 
reduce to the single fixed-point equation 



So — N I N 



r+ A n L f £ £ E E Kin - <«$ j)(«$ m - J)* 



(112) 



fcd/C t=l m=l 



In (111 2b . the expectation is taken with respect to <?Q^ fe l^fc )p(^fc)^Vt. d1l k dX k . Furthermore, (u)f2 m ) 

denotes the mean of with respect to the posterior measure q('H i [ ^ \y_,, X k )dT-6 k °^ , given by d 1 09b . Substi- 

tuting (TT5b into dl 12b after assuming So = g% t (t)I at t , we find that dl 12b reduces to (l38l) . 

Replica Continuity 

We evaluate (l80l >. (f86b . and ( |89l l under Assumption [2] Substituting ( TTTb and d96l > into (f80b and subsequently using 
and (1 100b . we obtain 



(113) 



G T {Q s ;n) = -{n - 1) IndetS - lndet(S + nS ) - niWlriTr. 
Next, from ( 1103b . the moment generating function d86l l for the users in K, reduces to 

(JtM 

ae{l}UA k 



M k (Q s ;n) 



P (ni 1}UAk ) 



i(yM k 0} ) IT i(K{ a} \y k ,x k ){E nili [ q (y k \n k 1} )\x k ]} n dy k 



(114) 



where q(%\^ \y_ k , X k ) id given by j 109b . Finally, calculating the first term on the right-hand side of d89b . we have 
HQs, Qs] n)=~Tr [I Nt - iV (S + nSo)" 1 - ^oS" 1 + + nS )- 1 S S- 1 ] 



lim — VlnE(e A *^ ; ™n , (115) 



in the limit K — > oo, in which M[cxp{A k (Q s ; h)}] is given by ( 1 108b . 

Equations dl 13b . dl 14b . and dl 15b are well-defined for n € R. Let us assume that they coincide with the true 
ones for n E [0, n c ) with some n c > 0. Then, substituting dl 13b . dl 14b . and dl 15b into d9~2b and taking n — > +0, 
we arrive at 



.lim Ha^,^) - f .lim At) J] 1 



m { k 1] \t k ,x k ) n i(K a} \y k ,x k 



a£A k 



o(K), 



(116) 
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where the expectation is taken with respect to the measure q(^ k \fl k °^ )p(T-L^ )p(Xk)dy_ k d'H k ^ dXk- We apply 



Tib to 



lim 



to obtain 



(^llrJll II P^r\lT, 



q& k 1} \y k ,x k ) J] q(H { k a} \y k ,x k ) 

a£A k 



(117) 



where qin^ 1 \y_ k , X^) is given by (|109b . It is straightforward to find that the right-hand side of ( II 171 i is equal to 
that of (l63l >. In the derivation of (11 17b , we have used the fact that liirtfj^+o Df, should be equal to 1 due to the 
normalization of the pdf (II 17b . We omit the proof of limfi_).+o Dj% — 1 since it requires complicated calculations 
and is beyond the scope of this paper. Furthermore, the proof of the convexity of (|94| i at the saddle point is also 
omitted for the same reason. For details, see ifTSl . 

Appendix D 
Derivation of Proposition [TJ 

A. Formulation 

We analyze the equivalent channel between the users in JC and their decoders in symbol period t(> r) to derive 
Proposition Q] Our analysis is based on the replica method, which is basically the same as that for Lemma HI 
presented in Appendix [C] However, there are two differences between the two replica analyses. One is that we 
replicate not only the channel vectors but also the data symbols. The other is that the self-averaging property of 
MAI with respect to X-j- t is shown by using Lemma Q] 

It is sufficient from Assumption[T]to show that the distribution of Btjz. conditioned on B^jc, H, and Ij- t converges 
in law to the right-hand side of (l36l l in the large-system limit. We first transform p(Bt : K.\Bt : K, : / H : lT T ) mt0 a formula 
corresponding to ( T70b . The posterior pdf of &t t K. postulated by the optimal detector, defined in the same manner as 
in (fl4l >. is given by 

,« ,-v c r ^ fP<yt=yt\Bt,S t ,I T MBt)dB tAK 



!p{y t = y t \B u s t ,i TT ) P {Bt)dBt ' 

with B ti \ K = {B t: k ■ for all k <£ JC}. In (1X151 . the pdf p(y t \B t ,S t ,T Tr ) is given by <Q3) with X t _ x = T T . The 
equivalent channel between the users in JC and their decoders is represented as 



p(B t ,,c\Btx,n,lT T 



(119) 



p(B t , K \yt,s t ,i n )p(yt\n,s t ,Bt)p(B tAK )dy t dB tAK , 

with B t> \ K = {B t ,k ■ for all k <£ JC}. In ( fTT9] >, the pdf p{y t \U,S t ,B t ) represents the true MIMO DS-CDMA 
channel (|2). Introducing a real number n, we obtain 



lim p{B t .K\Bt.K,T-L,lr T 



with 



E. 



Si 



lim lim E n (B t ,ic,Bt,ic,'H,l%.), 

if.L— >oo n— >+0 



P$t = yt\H,S t ,B t )p(B t )dB t p(H\lr T )dH 



(120) 



where the posterior pdf p(i-L\Xx T ) is given by ( fl3l l with l t -i = I T - 



(121) 
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B. Average over Quenched Randomness 



We evaluate ( 11211 ) only for n <E N in the large-system limit. Let b\ °^ = {b\ a ^ m G C : for all m} and H k °' i = 



da} 



■x/M 



{h 



l N : for all m} denote replicas of B±,k and Hk for a = 2, 3, . . ., respectively: {b\ ^ } are independently 
drawn from p(Bt,k) for all fc and a, and {?i^ a ^} conditioned on X-f r are mutually independent random vectors 
following p(Hk\XT T ), defined by (lilt , for all fc and a. For notational convenience, we introduce B^) = Bt,k, 



B 



{i} 
t.k 



B t ,k, H 



Hk, and 



H k . Note that "H< a} and H\ a} for a 



t,fe 

, n should not be confused 



with each other. {H.j, a ^} conditioned on X-y T are mutually independent for all k, while {H^} conditioned 011X7; 
have dependencies for all k. Taking the averages in (1121b over y t and S t in the same manner as in the derivation 
of (|79l i, we have 



E n (B tx ,B tx ,H,lrr)=p(Btx^ 



3 iGi(St;n) 



Bt,K,B t ,)c,H,2T T 



(122) 



where Gi(Q t , u) is given by ( T80b . In ( 1122b . the positive definite Hermitian matrix Qt E M 



(n+l)N 



is given by 



Qt = 



K M 



,(c) 



(o) \ff 



fc=l m— 1 



£ (w £ ) 

,/c,mV t,k,mJ 



with u; 



(c) 



in which w 



(c),{a} 
t,k,m 



iN 



is given by 



(c),{o} _ £M,{a} 
J t,k,m ~ " / k,m u t,k,m- 



(123) 



(124) 



C. Average over Replicated Randomness 

We next evaluate the expectation in ( 1122b with respect to Q t . In the same manner as in the derivation of 
the pdf of Q t conditioned on Bt,K, Bt,K, H, and Ij- T is evaluated as 



A' 



[(n+l)Af 



n^ c) (c*;»)U- Jf ^ )(at,6tin) de« 



(125) 



where we have used the fact that {H 



1} conditioned on Z7; are mutually independent for all k. 



In (1125b . Qt denotes an (n + 1)N x (n + l)iV non-singular Hermitian matrix, defined in the same manner as in 
p. The integration in ( 11251 ) with respect to each element of Q t is taken along an imaginary axis. The moment 



generating function M^ c \Q t ; n) for the users in K is defined as 



M^\Q t ;n) =E 



Bt,k,Bt,k,Hk,Z% 



(126) 



with 



(127) 



Furthermore, the function 1^ (Qt, Qt',n) is given by 

4 C) (Q*, Gt;n) = TT(Q t Q t ) - 1 £ In 



AL c) (S f ;n) 



e * 



(128) 



Note that the second term of the right-hand side of ( 1128b depends on H and Tj- T , whereas that of (T89b is a 
deterministic value. We use Lemma Q] to show that the second term on the right-hand side of ( 1128b converges 
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in probability to a deterministic value in the large-system limit. We re-write E[exp{A£ c (Qt] n)}\7~Lk,'£j T ] as 
X k (H k ,l TT ), given by 



~ n 

x k (H k) i TT ) = J .h(n k ,{ni a} }) n {p(ni a} \ir,)dn { k a} } 



with 



fk(n k ,{u { k a} }) = K 



AL c) (Q t ;«) 



e * 



n k ,{n k a} } 



(129) 



(130) 



where {n{ a} } = {% { k a} 



= l,...,n} denotes all replicas of Hi for user k. Lemma [T] implies that 



{X k (H k ,Xj- T ) : for all k} converges in law to uncorrected random variables X_ k (H k ,I T ^ k ) in the large-system 
limit, given by 

„ n M 

X k {U k ,l TT , k ) = J h{H k ,{n { k a} }) II IT {P(fc*,m = h[%\l TT<k<m )dh { k a l) , (131) 

a— 1 m— 1 

with Xj- k = {Z.r T km '■ f° r a H TO I- Fr° m the weak law of large numbers, we find that ( 1128b converges in probability 



to 



I {c) (Qu Ct;n) = Tv(Q t Q t ) - lim^ 1 ^E [lnE {e A * 0) | ?4,I rT)fc } 



(132) 



in the large-system limit. The convergence in probability of ( 1128b to ( 11321 ) allows us to use the same method as in 
the derivation of j92l . Consequently, ( 11221 ) yields 

E n {B t , K ,B t , K ,H,l Tr ) = D^p(B t , K ) { J] MfCsfV)} e -** (0, (eM ,, i»>[l + o(X )], (133) 

with 

$ (c) (Qt,Q t ;n) = ^(Ct.fitjnJ-^GiCGtjn). (134) 



In (E33), Dl c) is given by £>i c) = [detV|, $ (c) (Qt S) , Q^S ™)]~ 1/2 | detV% I {c) (Q { t s \ Q^ ;n)\- 1 / 2 , with the 
Hesse matrices V 2 a $( c )(q| s) , q| s) ; n) and / (c) (Q^\ ©F; n) of ( fl34l i and ( fl32l with respect to Q t and Q t 
at the saddle-point (Q t , Q t ) — (Q$ , Qt )> respectively, which is a solution to the coupled fixed-point equations 



A I 



Qt = Hm -V V E 

if->oo K ^ ^ 



E{u4;U w S,J HeA 


L c) (Q t ;«) 




E | e A< o) (S t ;n) 





k£K.m=l 

Qt = -f3-\l + Ax{n)Q t )~ x Ai{n). 



(135) 
(136) 



If the coupled fixed-point equations ( 11351 ) and J 1 36b have multiple solutions, the solution to minimize ( 1 134b is 
chosen. 



Remark 5. A non-negative- entropy condition would be defined via the entropy of ( 17251 ) ;/ ( 17231 ) is discrete or if 
the data symbols and the channel vectors were discrete random variables M8V . However, it is unrealistic to assume 
that the channel vectors are discrete. Thus, the conventional non-negative-entropy condition is not defined for the 
no-CSI case. A non-negative-entropy condition might be defined via the entropy for the pdf of Q t marginalized 
over the channel vectors. However, its calculation is not straightforward. 
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D. Replica Continuity 

The evaluation of d 133b . ( 11351 ). and ( 11361 ) under the RS assumption is almost the same as in the derivations of 
( II 17b . ( II 10b . and (II 1 lb . Therefore, we omit the details and only present the results: 



. M 

K ^ a P{B t jc\B t jc,nM = II J 90t,k\y t , k M I] {9N(y t ^ m -h k;m b tXm ;^)d^ km }, (137) 



keK. m=l 

for y_ t k m G C^. In (TOTT i. g N (y t k m - h ktm b t>ktfn ; 5$ } ) is defined as ©. Furthermore, q{B t , k \y t k ,l TT ), with 

y.t,k = {y t ,k,m : for a11 is given by 

J rim=i5 , n(y tj£ .„ l - h Km b t .k.. m ]^ y >)p(Bt,k)p(H k \lT T )dBt, k dHk 
In these expressions, (Eg , , S^) is a solution to the coupled fixed-point equations 



s t} = JV, 



M 



k$Km=l 

a M 



k£ICm=l 



(139) 



(140) 



where </^&^ m ) denotes the mean of h{%b^ m with respect to g(6^ m ,h^||/ t)fe)m ,X 7;)fcjm )d^ ) fc )m dhfe, m , 
given by 

w ~{a} 9N{y t k m - hj%b$ jTn ; Si t) )p(6S TO )p(fefc°igr T ,fc,m) 

J 5w(y t fc m - K,m b i, k>m ; ^a)p( h llJp(K,J^T T ,k, m )dbll m dh kim 

(141) 

with = In the right-hand sides of d 1 39b and ( 1140b . the expectations are taken with respect to the measure 
9 N (Ut,k,m ~ h k, m btXrn)^o)dy^ k Jp{b tXm ^ 

Let us assume = Then, the coupled fixed-point equations ( 1 139b and ( 1140b reduce to the single fixed- 
point equation 

t M 

S = N In + lim — 7 7 IE \{h km bt, k , m — (hk, m bt,k,m))(hk,m,bt,k,m — {hk, m bt,k,m)) H \ ■ (142) 

K — too J\ ^ — ' ^ — ' 

In ( 1142b . (hk,m,bt,k,m) denotes the mean of h k ^ m b t .k.m with respect to the posterior pdf (130b . Furthermore, the 
expectation is taken with respect to the same measure as that for ( 1139b . It is shown in Appendix [E] that (1142b reduces 
to d3~7l i under the assumption of = <j^In- Furthermore, it is straightforward to find that ( 1137b is equivalent to 
(f3~&b under Assumption Q] 

E. Multiple Solutions 

We consider the case in which the coupled fixed-point equations ( 1 139b and ( 1140b have multiple solutions. In this 
case, we should choose the solution minimizing ( 1134b under the RS assumption for n € [0, e), with a sufficiently 
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small e > 0. Since lim„_>. + o $( c )(Q^ s) , Qt ;n) = 0, that solution is given as the solution minimizing the derivative 
of ( 11341 ) with respect to n in the limit n —> +0: 



F rs ^ lim J-^(Q<C\Q?;n)= lim f*<»> (sf , 

n->+0 On n->+0 on V — — 



n , 



(143) 



with — lim„_j. + o Qf 1 and ' = lim n _> + o Q t ' , in which we have used the stationarity condition (11361 ) to 
obtain the last expression. 

We calculate ( f80b and ( 11321 ) in ( 11341 ) under the RS assumption in the same manner as in the derivations of ( II 131 ) 
and ( 11 151 ) and subsequently differentiate the obtained formula with respect to n in n — > +0, to have 



lim 



dGi 
dn 



(Q| s) ;-) 



= -lndet£ (t) - Tr 



iVlnTT, 



(144) 



lim 



a/ 



= lim EE[C fc , m ]ln2-iTr[l w -2iV (sW)- 1 +7V (S«)- 1 sW(S( t ))- 1 ' 



(145) 



with 



C k 



9N(y t , —hk )p(bt,k,m) log 



— t.fe.m 



i.%-,k,m 



d y-t,Km dbt < k > m > 



(146) 



where the conditional expectation is taken with respect to hfe iTO and b t: k,m- Substituting ( 1144b and ( 11451 ) into ( 11431 ). 
we obtain 



T^F rs = lim |: V VE[c\ m ]+2^ JAH|S] (t) )+i^^^ 
in z « — >oo A * — ' * — ' 

k$Km=l 

iVlog(TreiVo). 



71/ 



(147) 



It is straightforward to find that (1147b reduces to d39l with the exception of the constant N log(7reiVo), by substituting 

(*: 





£<M = S (t) = a 2 J N into (fl47l . 



Appendix E 
Derivation of (l37l ) 

We assume Sq = a 2 In- Taking the traces for both sides of ( 11421 ) divided by N, we obtain 



cr 2 = 7V + lim — V V MMSE 



M 



t.k.: 



with 



MMSE t>fe>ro = E \\\h k , m bt,k,m - (hk, m b t ,k, m )\\ 2 ] • 
In (11491 ). (hk. m bt,k.m) denotes the mean of hk,mbt,k,m with respect to the posterior pdf ( f30b . 



(148) 



(149) 
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We first calculate the conditional joint pdf p(y ,hk, m \bt,k,m,X.%. km) to evaluate the posterior mean of 



hi 



.m^t.k.m 



The channel vector hk, m conditioned on Tj- km follows CJ\f(h km , ?In), with h km and £ 



?(a 2 r (r), r) given by (|3TT > and (|32] |. respectively. On the other hand, (|27T > yields 

llM t ft™ ~ ^fc,m6t,fe,m|| 2 ' 



P(y tjfe . m l^fe,m^t,fe,m) 

Calculating p(y_ tkm ,h k 



1 



■ exp 



p{y t k m \hk, m ,bt,k, m )p{hk, m \ZT^k^ m )^ we obtain 



p (Mt fe m i h k,m\bt,k,m,ZT T ,k,- 



- p(h k , m \y t k m ,b t ,k,m,Irr,k,m)P(y t km \bt,k, m ,! TT ^ m ), 



(150) 



(151) 



with 



P(h k , m \y t km ,b t ,k,m,Zr T ,k,m) 



(P/M)e+a 2 c 

2„2 



h 



k.rri 



:{bt,k,' 



tt?<j, 



(P/M)£ 2 + ct c 2 y " l ' K ' mj -t^™ (P/M)? + al m ' m 
where we have used the fact that \b t ,k,m\ 2 equals P/M with probability one. By definition, the posterior mean of 
hk, m bt,k,m is given by 



exp 



(p/M)e + °i 



Tt 



(152) 



(h 



(P/M)? 



(P/M)? 



2M.t,k,m 



(P/M)? 



Thu m {bt,k,m)- 



We next evaluate ( 1149b . Substituting (|27| and ( 1153b into ( 1149b yields 

„2 



MMSE t , fe , r 
where c t ,k,m is defined as 

Ct,k,i 



Ct,k.T 



■ % 



(P/M)? + a- 



(P/M)? + a< 



-(hk,m - h.k,m)bt,k 



hh,m(bt,h,m - (bt,k,m)) 



(P/M)? 



(P/M)? + a, 



2 —t,k,m- 



(153) 



(154) 



(155) 



The fact that Ct t k,m and y t fc m conditioned on b tt k,m and Ij- t k m are jointly CSCG is useful for showing 
that the two terms on the right-hand side of ( 1154b are mutually independent under the same conditions. The 

* Tt 

means of c ti k,m and y t m conditioned on bt t k,m and k m are zero and h.k,mbt,k>m> respectively. Also, the 
covariance matrix of (cf km , yJ km ) T conditioned on bt,k,m and 1j- t km is evaluated as the diagonal matrix 
d^g{(P/M)?a 2 /{(P/M)? + a 2 }I N , {(P/M)? + a 2 }I N }. Therefore, c t , k ,m and y conditioned on b t ^ m 



and X 



TT,k,m 



are mutually independent. This fact indicates that the two terms on the right-hand side of (11541 



conditioned on 6t,fc,m and I-j- k m are independent of each other, since the second term on the right-hand side of 
( 1154b conditioned on bt k m and 2V h m is a function of y , . Hence, from ( 1154b . we have 

' ' — /t^,'"' —t.k.m 



MMSE t , fe , 



N(P/M)?a 2 ( 



E 



(P/M)?+a 2 \(P/M)? + a 2 
We substitute ( 1156b into the fixed-point equation (1148b to obtain d37b 



T 



ll^fc.mll \bt,k,m - (bt,k,m)Y 



(156) 
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Appendix F 
Proof of Proposition^ 

We take the expectation of (l36l l for JC — {k} with respect to p(Hk\%T T ) t0 obtain 

lim p{B t ,k\Bt,k,lT T iS t ) = p(B t k = Bt,k\Bt,k,Ir T ) inlaw. (157) 

K,L— yco 

Applying this expression to (ETT l. we have 

lim C sep = /? (l - ^) lim I(B T+1 . 1: y T+11 \l TT ), (158) 

where we have used the fact that B t k contains all information about the received vectors y. . in the single-user 
SIMO channel (|27l i for the estimation of B 4 

In order to show that the right-hand side of ( 1158b is equal to ( f4Tb , we regard the conditional pdf £>Q^ k \Bt,kt^T-r) 
as a random variable and write it as Xk(I%-', ©) > 0, with G = {Bt.k, {lit k m '■ f° r a ^ m }}' gi ven by 

X fe (Ir T ;e) = J f k (Hk;®)p(Hk\lT T )dHk, (159) 



with 



fk{Hk\&) = Y[ P(y t ,k,m = h k,mh,k, m + Ih,k, m \ h k, m ,bt,k, m ), (160) 
m— 1 

where p(y. , |fofe,m> &t,fc,ra) represents the single-user SIMO channel ( |27] >. There exists the moment generating 
function of ( 1159t since J159l l is bounded. Then, Lemma [T] implies that X k ~ llm=i ^Ofe m(2i-TT,k,mi ®) gi yen ©> 
defined as 

^fc,mSr T ,fc,m;Q) = J P(Et tktm = h k,rnb t ,k,m + Ih,k, m \hk,m,hXrn)p{hk,m\Zr T ,k,n) dh k,m- (161) 

In evaluating (116 It . of = cr^ r (r) for t £ %■ is given by the solution to the fixed-point equation d38l . Applying this 
result to ( 1158b . we find that the right-hand side of ( 1158b is equal to (ETI i. 

Appendix G 
List of Several Sets 

Table [TT] lists several sets used in this paper. The other sets, such as y, lj- t , and so on, are defined according 
to the rule described in Section iLAl The indices of chips, symbol periods, users, transmit antennas, and replicas 
are denoted by I, t, k, m, and a, respectively. The indices I, t, k, and m move from 1 to the spreading factor L, 
the coherence time T c , the number of users K, and the number of transmit antennas M, respectively. The index a 
runs from to n (n) for Appendix ICl (Appendix iDt. which denotes the number of replicas. In this paper, indices 
themselves have meanings, as noted in Section iLAl For example, X t should not be confused with Xk- The former 
denotes the pilot symbols in symbol period t, while the latter represents all pilot symbols for user k. 



April 21, 2011 



DRAFT 



IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. , NO. , 2010 



47 



TABLE II 
List of several sets. 



Sets 


Definitions 


Eqs 


K 


finite subset ot {1, . . . , K} 




Tt 


11,..., t\ 




Ct 


jr., . . . ,T r > 




J 


disjoint subsets of {2, . . . , n} 


ED 


yt 


r„ c (pJV . e nr oil n 
* t t ■ ior dii tj- 




yt 


S *i a ir*N . fw- oil /\ 
f t . ror all tj- 


(LZ) 


c 

Ot 


{$1 t k m £ C : for all i, fc, ?n} 


l9l 

Ui 


rik 


/Ji, C (P-^ ■ fnr nil -ml 




H k 


replica of 




H k 






rik 


"t**fc m t ■ ror ail TTij- 




H k 


replica of 




n k 








{ x t km £ C : for all m} 


rj 


B t ,k 


{&t,fc,m G C : for all m} 


@ 


B t ,k 


{h,k, m e C : for all m} 




u t,k 


replica of B t 




°t,k 


B t ,k 




U t 


{ u t km S C : for all k, m} 


Gt 


It 


{y t , St, u t } 




% 


{yt, s t } 




y t , k 


\y eC N : for all m} 


ED 


—t , k , m 




ED 
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